Augmented and virtual reality

ABSTRACT

A method for creating an augmented reality scene, the method comprising, by a computing device with a processor and a memory, receiving a first video image data and a second video image data; calculating an error value for a current pose between the two images by comparing the pixel colors in the first video image data and the second video image data; warping pixel coordinates into a second video image data through the use of the map of depth hypotheses for each pixel; varying the pose between the first video image data and the second video image data to find a warp that corresponds to a minimum error value; calculating, using the estimated poses, a new depth measurement for each pixel that is visible in both the first video image data and the second video image data.

FIELD

The field includes image capture, augmentation and displays as well asbuilding arrangements of displays of augmented and/or virtual reality.

BACKGROUND

Virtual reality and augmented reality displays may be created anddeveloped by large corporations for commercial use. However, everydayusers may not access or develop such displays. Programming, cameraintegration and architecture hurdles exist for everyday users to createaugmented and virtual reality platforms which can include their ownpictures, videos, object selection and the like.

SUMMARY

Systems and methods here may bring creation and development tools toeveryday users to allow them to create virtual and/or augmented realityscenes with their own images and/or provided images. In someembodiments, augmented reality development systems are include, using acomputer processor in communication with a data storage and a networkthe computer processor including instructions to, receive image dataover the network, map the received image data in a scene, inserttracking markers into the mapped image data scene, receive instructionsover the network to insert objects into the mapped image data scene,send, over the network, the image data scene with tracking markers andthe objects to a client device for display of the image data scene. Insome embodiments, the computing processor includes further instructionsto receive a second image data over the network, map the second imagedata in a second scene; insert tracking markers into the second mappedimage data scene, receive instructions over the network to insertobjects into the second mapped image data scene, store the mapped imagedata scene and the second mapped image data scene in the data storageindicating their relationship as linked scenes, send, over the network,the image data scene and second image data scene with tracking markersand the objects to a client device for display of the image data sceneand second image data scene, wherein the display of the image data sceneand second image data scene may be linked. Additionally oralternatively, the objects are animated objects. Additionally oralternatively the objects are received over the network. Additionally oralternatively the objects are selected from a predefined set of objects.Additionally or alternatively the image data is a 360 degree image.Additionally or alternatively the object includes functionality totrigger an event. Additionally or alternatively the map of the imagedata includes computer instructions to, generate an estimated depth mapof a keyframe of the image data scene using estimated depth values ofpixels in the keyframe; and generate a point cloud using the estimateddepth map of the keyframe.

In some example embodiments, augmented reality systems are disclosedincluding using a computer processor in communication with a datastorage, a camera, a network, and a display the computer processorincluding instructions to, receive an image data scene over the network,receive tracking markers for the image data scene, receive objects forthe image data scene; cause display of, an image from the camera, theimage data scene and the received objects using the tracking markers.Additionally or alternatively the computer processor further includesinstructions to, receive a second image data scene over the network,receive second tracking markers for the second image data scene; receivesecond objects for the second image data scene; cause display of, theimage from the camera, the second image data scene and the receivedsecond objects using the second tracking markers.

In some example embodiments, virtual reality development systems aredisclosed including using a computer processor in communication with adata storage and a network the computer processor including instructionsto, receive instructions over the network image data, map the imagedata, receive instructions over the network to insert markers into themapped image data; cause display, over the network, of the markers in adisplay of the image data, receive instructions over the network toinsert objects into the mapped image data, cause display, over thenetwork, of the objects in the display of the image data.

Additionally or alternatively the received image data is two dimensionalimage data. Additionally or alternatively the received image data isthree dimensional image data. Additionally or alternatively otherfeatures may include, object occlusion, floor plans, timelines, addedannotations such as text, drawings, 3D images, display features such asprojections, screens, split screens, networked viewers, live streaming;multi-user displays, audio stereo, stability, face detection, zooming,distortion, and others.

Alternatively or additionally methods and systems are disclosed forcreating an augmented reality scene, the system comprising by acomputing device with a processor and a memory, receiving a first videoimage data and a second video image data, calculating an error value fora current pose between the two images by comparing the pixel colors inthe first video image data and the second video image data, warpingpixel coordinates into a second video image data through the use of themap of depth hypotheses for each pixel, varying the pose between thefirst video image data and the second video image data to find a warpthat corresponds to a minimum error value, calculating, using theestimated poses, a new depth measurement for each pixel that is visiblein both the first video image data and the second video image data. Someembodiments further comprising, creating a map of depth hypotheses for asubset of pixels in the first video image data. Some embodiments furthercomprising, updating the map of depth hypotheses in the first videoimage data with information from the new depth measurement. Someembodiments further comprising, selecting as a keyframe, the video imagedata, inserting the keyframe into a connected graph. Some embodimentsfurther comprising, analyzing the connected graph to find a globallyoptimal pose for the keyframe. Some embodiments further comprising,using the globally optimal pose for the keyframe to correct scale driftin the second video image data. Some embodiments wherein the calculationof the globally optimal pose is by, estimating similarity transformsbetween the keyframe and a second keyframe. Some embodiments furthercomprising, receiving sensor data regarding position information,processing the received sensor data into factors in the connected graph.Some embodiments wherein the sensor data is generated by at least one ofa gyroscope, an accelerometer, a compass or a GPS. Some embodimentsfurther comprising, marginalizing the connected graphs into probabilitydistributions, using the probability distributions in estimating posesfor new frames.

Alternatively or additionally, methods and systems here are for creatingan augmented reality scene, comprising, by a computing device with aprocessor and memory, receiving image data over a network, the imagedata being generated from a camera including multiple frames, estimatinga depth map of a keyframe of the multiple frames of the received imagedata using estimated depth values of pixels in the keyframe, generatinga point cloud using the estimated depth map of the keyframe, andgenerating a 3D mesh using the generated point cloud. Some embodimentsthe keyframe is a frame with a depth map and a position Some embodimentsfurther comprising, by the computer, for non keyframe frames,calculating a relative position to a keyframe using the depth map andposition of the keyframe, and refining the keyframe depth map. In someembodiments the 3D mesh is generated by, computing a normal vector foreach point in the point cloud, based on neighboring points; orientingthe computed normal vector of each point toward the camera pose of thekeyframe that the point belongs to. In some embodiments the 3D meshincludes multiple keyframe images, wherein the multiple keyframe imagesare overlapping keyframes merged to a single texture. In someembodiments the merging of multiple keyframes to a single texture is byweighting a keyframe distance to a mesh surface and a keyframe anglerelative to the mesh surface. Some embodiments further comprising, bythe computer, trimming the 3D mesh using known data structures, andapplying texture to a generated 3D model. Some embodiments furthercomprising, by the computer, filtering the point cloud using a voxeltree to remove noise points. Some embodiments further comprising, by thecomputer, receiving a second image data over the network, receivingsecond tracking markers for the second image data, receiving secondobjects for the second image data, causing display of, the image, thesecond image data and the received second objects using the secondtracking markers.

Alternatively or additionally, methods and systems here are for creatinga virtual reality scene, the system comprising, by a computing devicewith a processor and a memory, receiving a first resolution image data,receiving a second resolution image data, segmenting an object byidentifying a particular shape from pixels in the first resolution imagedata and the second resolution image data, causing display of an imageusing both the segmented object from the first resolution image data andimage data other than the segmented object from the second resolutionimage data.

Alternatively or additionally, methods and systems here are for creatinga virtual reality scene, the system comprising, by a computing devicewith a processor and a memory, receiving image data, fragmenting theimage data using a pattern, identifying an area of the image data toload first, associating the fragmented pattern portions with the area ofthe image data to load first, causing display of the fragmented patternportions identified to load first, and causing display of a remainder ofthe fragmented pattern portions.

Alternatively or additionally methods and systems here are for creatingan augmented reality scene, the system comprising, by a computing devicewith a processor and a memory, receiving a first image data, wherein thefirst image data is a 360 degree image data and the computer is furtherconfigured to apply the first image data as texture to a sphere objectfor display, and receiving a second image data, comparing the firstimage data and the second image data to correlate features common toboth calculating depth of the correlated features of the first andsecond images, and using the calculated depth of the correlated featuresto render a stereoscopic image display. Some embodiments furthercomprising, by the computer, applying a filter to the first image dataand the second image data, wherein the filter identifies objects in thefirst and second image data and compares their positions, merging thefirst and second filtered image data for display. Some embodimentswherein the filter removes moving objects. Some embodiments wherein thefilter removes changing light conditions.

Alternatively or additionally, methods and systems here are for creatinga virtual reality scene, the system comprising, by a computing devicewith a processor and a memory, receiving a image data, wherein the imagedata is a 360 degree image data and the computer is further configuredto apply the image data as texture to an object for display, andreceiving an indication of a first and second position in the imagedata, using the received first and second position in the image todefine a canvas within the image data, receiving information regarding acamera height from a floor at the time the image data was captured,calculating angles from the received first and second positions and theheight of the camera, and calculating distances between the first andsecond positions in the display of the image data Some embodimentsfurther comprising, by the computer, mapping objects in the display ofimage data, calculating angles of objects in the display of image data.Some embodiments further comprising, by the computer, receiving a floorplan for the image data, receiving placement of the received image dataonto a position in the floor plan, calculating distances between a firstand a second position in the floor plan. Some embodiments furthercomprising, by the computer, calculating a rotation angle of the camerausing the camera height, angles between the canvas and a second canvas,and the floor, applying a correction to the calculated rotation angle ofthe camera to the image data for display.

BRIEF DESCRIPTIONS

For a better understanding of the embodiments described in thisapplication, reference should be made to the Detailed Description below,in conjunction with the following drawings in which like referencenumerals refer to corresponding parts throughout the figures.

FIG. 1a is an illustration of the overall system architecture inaccordance with certain aspects described herein;

FIG. 1b is an illustration of the components and functionalitiescontained in the system in accordance with certain aspects describedherein;

FIG. 1c is a UML class diagram illustrating the structure of a Holocontaining one or more AR and/or VR scenes in accordance with certainaspects described herein;

FIG. 1d is a flowchart illustrating the workflow for accessing thesystem in accordance with certain aspects described herein;

FIG. 1e is a flowchart illustrating the method for creating a Holocontaining one or more AR and/or VR scenes in accordance with certainaspects described herein;

FIG. if is a flowchart illustrating the method for creating or selectingan AR or VR scene within a Holo in accordance with certain aspectsdescribed herein;

FIG. 1g is a flowchart illustrating the method for adding a 2D or 3Dobject to an AR or VR scene or selecting and editing an existing 2D or3D object within an AR or VR scene in accordance with certain aspectsdescribed herein;

FIG. 1h is a flowchart illustrating the method for adding an animationto a 2D or 3D object or editing an existing animation associated with a2D or 3D object in accordance with certain aspects described herein;

FIG. 1i is a flowchart illustrating the method for adding a triggerableaction to a 2D or 3D object or editing an existing triggerable actionassociated with a 2D or 3D object in accordance with certain aspectsdescribed herein;

FIG. 1j is a flowchart illustrating the method for saving a Holocontaining one or more AR and/or VR scenes in accordance with certainaspects described herein;

FIG. 1k illustrates a user interface for creating a new Holo containingone or more AR and/or VR scenes in accordance with certain aspectsdescribed herein;

FIG. 1l illustrates a user interface for creating and editing AR and VRcontent in accordance with certain aspects described herein;

FIG. 1m illustrates a user interface for creating a new AR or VR scenewithin a Holo in accordance with certain aspects described herein;

FIG. 1n illustrates a user interface for adding 2D and 3D objects to anAR or VR scene within a Holo in accordance with certain aspectsdescribed herein;

FIG. 1o illustrates a user interface for adding an animation to a 2D or3D object or editing an existing animation associated with a 2D or 3Dobject in accordance with certain aspects described herein;

FIG. 1p illustrates a user interface for adding a triggerable action toa 2D or 3D object or editing an existing triggerable action associatedwith a 2D or 3D object in accordance with certain aspects describedherein;

FIG. 1q illustrates a user interface for preprocessing a 360° imagebefore it is added to a VR scene within a Holo in accordance withcertain aspects described herein;

FIG. 1r illustrates a user interface for preprocessing a 3D objectbefore it is added to an AR or VR scene within a Holo in accordance withcertain aspects described herein;

FIG. 1s illustrates a user interface for creating or editing a customanimation associated with a 2D or 3D object contained in an AR or VRscene within a Holo in accordance with certain aspects described herein;and

FIG. 1t illustrates a user interface for consuming previously created ARand/or VR content directly in the web browser in accordance with certainaspects described herein.

FIG. 2a is an abstract system diagram representing the interactionbetween certain methods with the embedding system as well as with thedifferent types of users in accordance with certain aspects describedherein;

FIG. 2b is a flowchart diagram of an exemplary embedding system forcertain methods showing the general workflow in accordance with certainaspects described herein;

FIG. 2c is a system diagram representing the exemplary use of certainmethods for the construction-based use case in accordance with certainaspects described herein;

FIG. 2d is a flowchart diagram of the system process for importing 360°images and videos to a timeline-based scene in accordance with certainaspects described herein;

FIG. 2e is a flowchart diagram of the system process for creating andediting the timeline of 360° images and videos in accordance withcertain aspects described herein;

FIG. 2f is a flowchart diagram of the system process for adding newtimeline-elements to an existing timeline of 360° images and videos inaccordance with certain aspects described herein;

FIG. 2g is a flowchart diagram of the system process for adding 2D and3D content to an existing location- or timeline-based scene of thetimeline of a 360° image or video in accordance with certain aspectsdescribed herein;

FIG. 2h is a system diagram of the interaction process of the existingembedding system and the timeline system for the processing of the 360°images and video in accordance with certain aspects described herein;

FIG. 2i is an illustration of one possible implementation for expandingthe timeline panel for a chosen location in accordance with certainaspects described herein;

FIG. 2j is an illustration of one possible implementation of theexpanded timeline of a location at time 0 in accordance with certainaspects described herein;

FIG. 2k is an illustration of one possible implementation of the userinteraction to expand the timeline panel to create a new time-basedscene for an existing timeline in accordance with certain aspectsdescribed herein;

FIG. 2l is a partial diagram of the modal for creating a new time-basedscene in an expanded timeline as represented in FIG. 2k in accordancewith certain aspects described herein;

FIG. 2m is a partial diagram of the modal for creating a new time-basedscene in an expanded timeline as represented in FIG. 2k in accordancewith certain aspects described herein;

FIG. 2n is a partial diagram of the modal for creating a new time-basedscene in an expanded timeline as represented in FIG. 2k in accordancewith certain aspects described herein;

FIG. 2o is a partial diagram of the modal for creating a new time-basedscene in an expanded timeline as represented in FIG. 2k in accordancewith certain aspects described herein;

FIG. 2p is a partial diagram of the modal for creating a new time-basedscene in an expanded timeline as represented in FIG. 2k in accordancewith certain aspects described herein;

FIG. 2q is a partial diagram of the modal for creating a new time-basedscene in an expanded timeline as represented in FIG. 2k in accordancewith certain aspects described herein;

FIG. 2r is a diagram of one possible implementation showing the in FIG.2k to FIG. 2q newly created time-based scene and the according timelineof the exemplary location at a subsequent time in accordance withcertain aspects described herein;

FIG. 2s is a partial diagram of the modal for the user interaction tochange the scenes settings of a time-based scene in an expanded timelineas represented in FIG. 2s in accordance with certain aspects describedherein;

FIG. 2t is an illustration of one possible embodiment of the viewer fortimeline-based VR tours in accordance with certain aspects describedherein;

FIG. 2u is an illustration of the expanded timeline navigation panel ofthe viewer represented in FIG. 2t in accordance with certain aspectsdescribed herein;

FIG. 2v is an illustration of the expanded timeline navigation panel ofthe viewer represented in FIG. 2t at a subsequent time on the timelinein accordance with certain aspects described herein;

FIG. 2w is a more space-saving alternative illustration of one possibleimplementation of the opened timeline as dropdown of a chosen locationat time 0 in accordance with certain aspects described herein;

FIG. 2x is an illustration of an alternative embodiment of the viewerfor timeline-based VR tours in accordance with certain aspects describedherein;

FIG. 2y is an illustration of the expanded dropdown navigation panels ofthe viewer represented in FIG. 2x in accordance with certain aspectsdescribed herein;

FIG. 3a illustrates the overall process of adding a Floor Plan to anexisting or newly created Holo in accordance with certain aspectsdescribed herein;

FIG. 3b illustrates the process of importing a Floor Plan from varioussources and formats in accordance with certain aspects described herein;

FIG. 3c illustrates the process of interconnecting a selected Scene witha Location on an imported Floor Plan in accordance with certain aspectsdescribed herein;

FIG. 3d illustrates the process of extracting a high definition versionof a Floor Plan from a document after applying transformations likecropping or rotation in accordance with certain aspects describedherein;

FIG. 3e illustrates the user interface of the Editor editing a Holoincluding Floor Plans in accordance with certain aspects describedherein;

FIG. 3f illustrates an exemplary user interface of the Editor for theimport of a Floor Plan in accordance with certain aspects describedherein;

FIG. 3g illustrates an exemplary user interface for Hotspot navigationand creation on an enlarged Floor Plan in accordance with certainaspects described herein;

FIG. 3h illustrates an exemplary user interface for representation andaddition of orientation to Hotspots on a Floor Plan in accordance withcertain aspects described herein;

FIG. 4a is an illustration of how the virtual camera is adjusted inmultiple panoramic images in accordance with certain aspects describedherein;

FIG. 5a illustrates an exemplary way to apply a position to a photo inaccordance with certain aspects described herein;

FIG. 5b illustrates how added locations could be visualized inaccordance with certain aspects described herein;

FIG. 5c illustrates an exemplary use case where this method can be usedin accordance with certain aspects described herein;

FIG. 5d illustrates the possibility of annotating photos in accordancewith certain aspects described herein;

FIG. 6a illustrates a possible visualization of the canvas-creation-modeand the resulting features, including but not limited to measuringdistances and angles besides extracting and projecting surfaces inaccordance with certain aspects described herein;

FIG. 7a illustrates a HTML 2D overlay over a 3D scene, which enables thecreator of a Holo to create any HUD for the player mode in accordancewith certain aspects described herein;

FIG. 8a is an illustration of the underlying concept of the annotationand task system located in 360° images in accordance with certainaspects described herein;

FIG. 8b is a flowchart illustrating the workflow of creating a newannotation in accordance with certain aspects described herein;

FIG. 8c is a flowchart illustrating the workflow of creating a newannotation from a preselection of objects in a Holo in accordance withcertain aspects described herein;

FIG. 8d is a flowchart illustrating the workflow of being notified of,processing and resolving an annotation in accordance with certainaspects described herein;

FIG. 8e is an exemplary illustration of a synchronized annotation listembedded into a Holo in accordance with certain aspects describedherein;

FIG. 8f is an exemplary illustration of the synchronization of aresolved task in accordance with certain aspects described herein;

FIG. 8g is an illustration of the difference between global and localannotation lists in accordance with certain aspects described herein;

FIG. 9a is an illustration of an exemplary free-form strokes createdwith the painting tool in accordance with certain aspects describedherein;

FIG. 9b is an illustration of an exemplary predefined geometric figurescreated with the painting tool in accordance with certain aspectsdescribed herein;

FIG. 10a is an illustration of the view for the user in the player modefor a Holo with other users added to the scene as part of themulti-user-experience in accordance with certain aspects describedherein;

FIG. 10b is an illustration of the visualization of the user in a 360°scene looking at a specific location in this 360° scene and focusing ona specific user as part of the multi-user-experience in accordance withcertain aspects described herein;

FIG. 11a is an illustration that visualizes audio sources in a scene inaccordance with certain aspects described herein;

FIG. 11b is a flow chart that visualizes an exemplary import processesfor audio sources in accordance with certain aspects described herein;

FIG. 12a is an illustration of a 360° live stream watched and annotatedby one or more users at the same time in accordance with certain aspectsdescribed herein;

FIG. 13a is an example of an undistorted view of the visual content in aHolo in accordance with certain aspects described herein;

FIG. 13b is an example of a circular fisheye view of the visual contentin a Holo in accordance with certain aspects described herein;

FIG. 13c is an example of a Cartesian fisheye view of the visual contentin a Holo in accordance with certain aspects described herein;

FIG. 13d is a flowchart-like diagram illustrating steps and features ofa systems and/or methods that permits a user to alternate between anundistorted overview of visual content in a Holo and a fisheye view ofthe visual content in accordance with certain aspects described herein;

FIG. 14a is an example of a possible UI for a loading screen in theplayer mode of an AR/VR editor and player in accordance with certainaspects described herein;

FIG. 15a is an illustration of the automatic face detection in the VRscene in accordance with certain aspects described herein;

FIG. 16a Automatic dynamic rendering resolution adjustment to keep astable framerate in accordance with certain aspects described herein;

FIG. 17a is an exemplary illustration of a tiled 360° image for use in aHolo in accordance with certain aspects described herein;

FIG. 17b is an illustration of a low-resolution single-tile 360° imageoverlaid with an exemplary tiled high-resolution version of the same360° image for use in a in accordance with certain aspects describedherein;

FIG. 17c is an architecture diagram illustrating the computerarchitecture for uploading, preprocessing, storing and delivering tiled360° images in accordance with certain aspects described herein;

FIG. 17d is a flowchart illustrating the process of receiving a 360°image, creating a low-resolution and a high-resolution tiled versionthereof and storing them in accordance with certain aspects describedherein;

FIG. 17e is a flowchart of delivering a tiled 360° image for display ina Holo in accordance with certain aspects described herein;

FIG. 18a is an illustration of the computer architecture for uploadingan object to be used in a Holo and checking for an existing hash valuein accordance with certain aspects described herein;

FIG. 18b is an illustration of the process of uploading an object to beused in a Holo and checking for an existing hash value in accordancewith certain aspects described herein;

FIG. 19a is an illustration of an asynchronous web component runninglocally in the browser of the client to import and process 3D models ofdifferent formats locally without the need to upload them or the need ofan active internet connection in accordance with certain aspectsdescribed herein;

FIG. 20a is an illustration of an automatic mesh simplification andtexture reduction, rescaling and adjusting on left memory when a 3Dmodel is loaded and rendered on mobile devices in accordance withcertain aspects described herein;

FIG. 21a is a diagram that describes a method that can apply a user'srotation to consecutive images in accordance with certain aspectsdescribed herein;

FIG. 21b is an illustration that shows a user's field of view withoutany rotation of the user in accordance with certain aspects describedherein;

FIG. 21c is an illustration of what should happen to the user's field ofview if a user rotates in accordance with certain aspects describedherein;

FIG. 21d is an illustration of how stabilization of the field of viewcan be beneficial in accordance with certain aspects described herein;

FIG. 22a is an illustration of how using depth estimation to correctlydisplay a panoramic image on a stereoscopic device in accordance withcertain aspects described herein;

FIG. 23a visualizes how videos can be enhanced by additional elements inaccordance with certain aspects described herein;

FIG. 24a is an illustration of the overall tracking system in accordancewith certain aspects described herein;

FIG. 24b is an illustration of the benefits of the method by enablinglong distance tracking in accordance with certain aspects describedherein;

FIG. 24c is an illustration of how the pose of detected objects isdetermined in accordance with certain aspects described herein;

FIG. 25a is a flowchart illustrating the workflow of the overall systemin accordance with certain aspects described herein;

FIG. 25b is an illustration of the first use case of the embodiments inaccordance with certain aspects described herein;

FIG. 25c is an illustration of the second use case of the embodiments inaccordance with certain aspects described herein;

FIG. 26a is a flowchart of a 3D marker generation using a geometry meshin accordance with certain aspects described herein;

FIG. 27a is a flowchart of a pipeline of a mesh generation in real timeand its usage in accordance with certain aspects described herein;

FIG. 28a shows the general process of 360° image fusion in accordancewith certain aspects described herein;

FIG. 28b depicts a series of 360° images containing people and noise inaccordance with certain aspects described herein;

FIG. 28c depicts a series of 360° images with different lightingexposures in accordance with certain aspects described herein;

FIG. 29a is an exemplary illustration of a spherical or panorama camerawith at least two lenses in accordance with certain aspects describedherein;

FIG. 29b is an exemplary illustration of a rig construction of at leastfour cameras to shoot spherical or panoramic images in accordance withcertain aspects described herein;

FIG. 29c illustrates a device for 3D scanning in accordance with certainaspects described herein;

FIG. 29d illustrates a graphics tablet for digital drawing in accordancewith certain aspects described herein;

FIG. 30a illustrates a computing device in accordance with certainaspects described herein;

FIG. 30b is an exemplary illustration of a head-mounted device using asmart device for rendering in accordance with certain aspects describedherein;

FIG. 30c is an exemplary illustration of a head-mounted device with abuilt in display system in accordance with certain aspects describedherein;

FIG. 30d is an exemplary illustration of a head-mounted device with asmart device and a reflective surface in accordance with certain aspectsdescribed herein;

FIG. 30e is an exemplary illustration of a head-mounted device with aprojector and a reflective surface in accordance with certain aspectsdescribed herein; and

FIG. 30f is an exemplary illustration of a head-mounted device with aprojector in accordance with certain aspects described herein.

FIG. 31a is an exemplary illustration of an augmentation systemcomprising a projector unit, sensors and a smart device;

FIG. 31b is an exemplary illustration of a head-mounted device with abuilt-in projector unit, sensors and a computing unit; and

FIG. 31c is an exemplary illustration of the system that aligns thevirtual projected content with the physical space.

DETAILED DESCRIPTION

Reference will now be made in detail to embodiments, examples of whichare illustrated in the accompanying drawings. In the following detaileddescription, numerous specific details are set forth in order to providea sufficient understanding of the subject matter presented herein. Butit will be apparent to one of ordinary skill in the art that the subjectmatter may be practiced without these specific details. Moreover, theparticular embodiments described herein are provided by way of exampleand should not be used to limit the scope of the particular embodiments.In other instances, well-known data structures, timing protocols,software operations, procedures, and components have not been describedin detail so as not to unnecessarily obscure aspects of the embodimentsherein.

Overview

Three dimensional virtual reality and augmented reality may be createdusing modern computer technologies. Users may wear a headset device inorder to receive two images, one directed toward each eye, and togethermay create the illusion of three dimensional space and depth.Additionally or alternatively, certain embodiments may cause the displayto move, depending on interaction from the user, for example, if sensorsin the headset determine that the user is moving her head, the displayis changed to coordinate to the movement. In this way, a user may feelas if she is actually in the virtual computer generated or augmentedspace.

Virtual reality may refer to images presented to a user which are allcomputer generated. In some examples, when a user moves her head, thevirtual reality display moves in conjunction, in order to give theimpression that the user is in another place, a virtually real place,which may be interacted with using various sensors. In some embodiments,a user may be presented with camera images of another place that may berecorded or live or near live conditions. The user may interact in thesame way as described with augmented reality features laid into thedisplay. For example, a user in Utah may wear a virtual reality headsetin order to feel like he is at a baseball game in Florida. He may movehis head to view different angles and see different parts of the fieldas if he were really in Florida. Computer generated items or displaysmay be laid into the images he views in order to show him statistics ofthe game, replays of earlier action, or broadcasters and pundits whocomment during the game. Augmented reality may refer to images presentedto a user which are based on camera images with additional computergenerated graphics. The camera images may be taken from the place theuser is located, for example by a smartphone that is also used togenerate the images for viewing. These camera images may be augmentedwith computer generated images in order to add an item or aspect to whatthe user would see without the technology. For example, a user wearingan augmented reality headset may see a camera image of an office, butinstead of a window, the computer generates an image of outer space. Orinstead of seeing a coffee mug on their desk, the user sees a cartooncharacter on her desk instead. Again, these images may be manipulatedand interacted with, but are computer generated. Another way howaugmented reality content can be consumed is by using a head-mounteddevice to augment the user's field of view with additional virtualcontent overlaid perspectively correct on the physical scene. In thiscase the recorded camera image is not necessarily shown to the user.Another term used in the art is mixed reality. Mixed reality may referto augmented reality but imposed on a place with physical objects in it.For example, the user is using devices that have a camera through whichthey view the actual physical world around them, but the computerchanges the color of the desk to blue. The user may reach out and touchthe physical desk because they are in the room with the desk, butthrough the computer generated images, the desk color has changed. Inthis disclosure, terms such as virtual reality, augmented reality andmixed reality may be interchangeably used and are not intended to belimiting.

One problem with VR and AR is that building or modifying scenes forviewing, may be inaccessible to regular users. These regular users maynot be able to create their own VR or AR applications withoutsignificant knowhow in the software engineering space. Disclosed hereare methods and systems including corresponding web-based platforms forenabling average users without specific knowledge in programming and/ordesign to create AR and/or VR content. The platform lets users createHolos, or one or more AR and/or VR scenes. These scenes may be createdby the user and stitched together by the systems and methods here inorder to provide an overall experience to the user.

Through the systems described here, AR scenes may be created by draggingand dropping into the platform running in the web browser either 2D or3D markers for tracking as well as one or more 2D or 3D objects. VRscenes may be created by dragging and dropping into the platform runningin the web browser a 360° spherical image or video and/or one or more 2Dor 3D objects. Besides the ability to import arbitrary 2D or 3D objects,the platform may also provide a selection of predefined 2D and 3Dobjects that can be added to an AR or VR scene without the requirementof further resources. In some embodiments, all objects present in ascene may be manipulated, animated and associated with additionalinformation as well as actions such as transferring to a different sceneor opening a website, and all created Holos may be saved persistentlyand consumed in the dedicated player mode of the platform directly inthe browser and may be reachable via a URL.

Once the scenes are stitched together, the user may interact by viewingone scene at a time, and navigating to the next scene with interactionssuch as clicking an arrow in the scene to move the scene to the nextscene. Additionally or alternatively, in some embodiments, sensors inuser equipment may be used to identify location, movement and/ororientation. Such information may also be used to navigate throughscenes created by the systems. In such a way, a user may feel as if sheis walking down a street, panning the camera or moving a headset to viewvarious angles including but not limited to 360° around and even up anddown, relative to the user as discussed in detail below. It should benoted that in this disclosure, the term “Holo” may be used to refer toone or more AR/VR scenes that may be stitched together. The term is notintended to be limiting, but merely describe a holographic, or Holoscenario, which is not necessarily either one of an AR or VR scene, butany.

Thus, the systems and methods here may, alternatively or in anycombination, give a) the ability to enable the creation of both, AR andVR content using the same web platform; b) give average users withoutspecific skills in design and/or programming (end-user design),particularly concerning the possibility to compose AR and VR content viadrag and drop; c) provide a completely web-based system to carry theseout; d) import and use arbitrary 2D and 3D objects, and also choose froma set of predefined objects; and/or e) create VR scenes directly from360° spherical images or videos.

It should be noted that computing resources that may be used to carryout what is described here, could be any number of devices including butnot limited to a desktop, laptop, tablet, phablet, smartphone, wearablesuch as glasses, helmets, or other smart computing devices. Wirelesscommunication connections could be any number of things includingcellular, WiFi, Near Field Communications, Bluetooth, Pico cell, Nanocell, or any other kind of communication protocol, standard, or method,even those invented after this disclosure was written.

Architecture Examples

FIG. 1a is an illustration of an example overall system architecture.The example methods and systems here are based on a computerarchitecture comprising a computer 110104 such as a personal computer ormobile computer such as a smartphone, including any number of devicesusing any number of operating systems 400122 on the client-side and aserver 110114 and data store 110116 on the server-side, which areconnected by a communications network 110112. The computing systemsincluding the server, data store, client personal computer, may eachinclude components such as a processor, memory, random access memory(RAM), data storage, distributed data storage, network connection, andother peripherals such as still cameras, video cameras, microphones,speakers, infrared cameras, depth mapping cameras, light detection andranging (LIDAR) systems, radar systems, geographical positioning systems(GPS), acoustical sensors and mapping devices, etc. where the memory andprocessors may run any number of software programs as disclosed here onan operating system.

The system implementing the methods here may be delivered by the server110114 as a service to the user 110102 that they can access using theirweb browser 110108 with an embedded 3D engine 110110. The providedservice may be delivered in terms of static web pages 110124,client-side scripts 110122 and/or dynamic web pages generated byserver-side programs 110118 and scripts 110120 that reside in the datastore 110116 on the server-side. In some embodiments, in order toeffectively use the service, the user 110102 may have a user profile110126. Their created Holos 110128 containing one or more AR and/or VRscenes may be persistently stored in the data store 110116, along withany imported 2D and/or 3D models 110117. Predefined 2D and 3D modelsprovided by the service 110117 may reside on the data store 110116 aswell.

FIG. 1b illustrates example components and functionalities additionallyor alternatively contained in the system. The methods and systemscomprise a number of components and functionalities that may bedistributed between the server and the client side. Both the Server110204 and the Client 110202 comprises technical devices 400106. Theserver side 110204 may direct actions involving direct communicationwith the data store 110116, i.e., user management 110226, e.g.,registering an account, logging in etc., and saving/loading Holos andthe AR/VR scenes and 2D/3D objects contained therein 110228. On theclient side 110202, the user 110202 may interact with the editor 110232that enables the user to create AR/VR content and the player 110230 thatenables them to consume previously created AR/VR content. In particular,the editor 110232 enables the user 110102 to create 110216 and edit110214 Holos, create 110212 and edit 110210 AR and VR scenes containedin Holos, add 2D/3D objects 110208 to AR/VR scenes and edit existingones 110206, adding animations 110224 to 2D/3D objects and editingexisting ones 110222, and adding triggerable actions 110220 to 2D/3Dobjects and editing existing ones 110218. Further, a computing systemmay be used for viewing the AR/VR scenes once built, as described indetail below.

Holo Overview

Additionally or alternatively, the systems and methods here may supportcertain Holo building and displaying experiences FIG. 1c illustrates anexample structure of a Holo 110302 created using a computing device400106. It comprises several scenes 110304, that are either AR 110306 orVR 110308 scenes, whereas one Holo can contain scenes of both types andmay contain at least one scene of any type. In the example, one AR scene110306 contains a certain number of dedicated tracking markers 110310that may be either a 2D or 3D object. The tracking marker 110310 may actas the virtual connection to the real world in an AR context, i.e., thecreated AR content is displayed relative to that marker in a see-throughscenario where a user is viewing their own surroundings through a cameraor see though device, and the systems are incorporating a marker on aphysical object detected and tracked as described herein. For instance,if one wants to augment his real-world laptop with virtual objects in anAR scenario, either a 2D image of the keyboard of the laptop or a 3Dscan of the laptop could act as markers. In the editor 110232, virtual2D 110316 and/or 3D objects 110318 would then be placed in relation tothe imported marker (which could be accomplished via drag and drop, orother interaction for example) as they should appear relative to thereal laptop in the AR scenario independent of the position of thecamera. A 3D scan can be created with a 3D scanning device 390302, theimage with a camera 400108. An AR scene 110306 without a marker is anempty scene and can be used to create fully virtual world by importingany number of 2D 110316 and/or 3D objects 110318. One VR scene 110308contains one 360° spherical image or one 360° spherical video 110314 asthe basis. Spherical and panoramic images or videos can be created usinga spherical or panorama camera 390102 or using rigs of multiple cameras390200. That is, the scene is automatically initialized to contain asphere on whose inside the 360° image/video is placed as the texture.The virtual camera through which the user 110102 consumes the VR sceneis then placed at the center of the sphere, so that they have theillusion of standing at the place where the image or video was shot. Anynumber of objects 110312 can be imported into a scene, in terms ofeither 2D 110316 or 3D 110318 objects. For instance, in this way avirtual chair and table given as 3D objects can be placed within the360° image of a living room, which is a potential use case for interiordesigners. Any number of objects 110312 can be animated 110322 andassociated with actions 110320, such as transferring to a differentscene or opening a website, which are triggered when the object isclicked.

Web Based System Access Examples

Additionally or alternatively, FIG. 1d illustrates an example workflowto access the system. In this example, a user can open a web browser110402 on a computing machine 400106. Without the need to installadditional software, the system can be accessed by navigating to an URLthat identifies the platform 110404. The system can be accessed withhelp of a web browser 110402 although the system can run on a differentcomputing machine 400106 than the one the user is using. Thus, as shownin FIG. 1a , the servers 110114 and data storage of the systems 110116may be accessed over a network 110112.

Holo Creation and Loading Overview Examples

FIG. 1e 110500 describes an example workflow of creating or loading aHolo using a computing device 400106. If the user 110102 intends tocreate a Holo 110504 on a computing machine 400106, they have to decideon the type of the first scene contained in the new Holo 110506, as aHolo may contain at least one AR or VR scene. The initial scene is thencreated by the editor 110232 in terms of either an AR 110510 or VR110508 scene template. If the user 110102 intends to instead load anexisting Holo, they may choose from a list of Holos 110512 delivered bythe server 110114 based on the Holos 110128 associated with the user'sprofile 110126 in the data store. The selected Holo may then be loaded110514 and displayed by the editor 110232.

FIG. if 110600 illustrates an example method for creating or selectingan AR or VR scene within a Holo using a computing device 400106. If theuser 110102 intends to create 110602 a new scene within a Holo, they maychoose between creating an AR or VR scene 110604 or select a scene froma list of scenes 110612. In case an AR scene is created, the user maydecide whether or not a marker should be used 110606. If a marker shouldbe used, it may be imported in terms of a 2D or 3D object 110608. Incase a VR scene should be created, a 360° spherical image or 360°spherical video may be imported 110610. Contrary to AR scenes, it maynot be possible to create an empty VR scene because an empty VR scenehas no camera input to display like an AR scene would. If the userinstead of creating a new scene decides for selecting an existing one,they may have to choose from the list of scenes 110612 contained in theHolo they are currently editing.

FIG. 1g 110700 illustrates an example workflow of adding or editing a 2Dor 3D object to an AR or VR scene contained in a Holo using a computingdevice 400106. In this example, first, a scene may be selected from thelist of existing scenes 110702 in the Holo the user 110102 is currentlyediting. In case the user wants to add a new object 110704, they maydecide 110706 between choosing from a set of predefined 2D and 3Dobjects provided by the web platform 110708 or importing a custom object110710. The latter happens by an interaction such as for example,dragging and dropping one or more 2D and/or 3D object files into the webplatform. In case the user 110102 wants to edit an existing objectrather than creating a new one, they can do so by first selecting theobject 110712 either directly within the scene currently displayed bythe editor 110232 or choosing from the list of objects for that scene.Then, the object may be edited 110714 in any number of ways includingbut not limited to position, size, rotation color, and/or texture.

FIG. 1h 110800 illustrates an example method for adding an animation toa 2D or 3D object or editing an existing animation associated with a 2Dor 3D object using a computing device 400106. The process of adding orediting an animation to a 2D or 3D object may start with selection ofthe target object 110802 from the scene currently displayed in theeditor 110232. Subsequently, the user 110102 may decide on whether theywant to add a new animation or edit an existing one 110804. In exampleswhere they want to add an animation to the object 110806, they canselect from a list of given animations or compose a custom animationfrom sequences of rotating and/or scaling and/or repositioning theobject. In some example embodiments, editing an existing animation mayonly possible if there is already an animation associated with thetarget object 110808. If an animation exists, it can either bemanipulated or replaced 110810. If the user 110102 decides to replacethe existing animation, it may be deleted 110812 and they select orcompose a new one 110806. Otherwise, the existing animation may be kept,but altered by the user 110814.

FIG. 1i 110900 describes an example workflow of adding or editing atriggerable action associated with a 2D or 3D object contained in an ARor VR scene using a computing device 400106. A triggerable action may bea programmed animation or change that is only imparted upon someprogrammed trigger event. For example, an animated object may changecolor if the user comes within a predetermined distance to the object.First, in the example the target object in the scene currently displayedin the editor 110232 has to be selected 110902. Subsequently, the user110102 may decide on whether a new action should be associated with thetarget object or whether an existing action should be edited 110904. Incase they want to add a new triggerable action to the object, they canselect from a list of given actions 110906, set the parameter of theaction (e.g., a URL for an “open website” action) 110908 and finally addthe action to the target object 110910. Editing an existing action isonly possible if there is already an action associated with the targetobject 110912. If an action exists, it can either be manipulated orreplaced 110914. If the user 110102 decides to replace the existingaction, it may be deleted 110916 and they select 110906, define 110908and add 110910 a new one. Otherwise, the existing action may be kept,but its parameters may be altered by the user 110918.

FIG. 1j 111000 describes an example process of saving a Holo using acomputing device 400106. In case the Holo is saved for the first time111002, the user 110102 may provide the Holo a name and optionaldescription 111004. Subsequently, the Holo including all scenes,objects, and metadata may be transferred to the server 110204 by theeditor 110232, where it is assigned a URL (if saved for the first time)111006 and finally saved 111008 to the data store 110228.

Scene Builder User Interface Examples

In certain example embodiments, a user interface may be used to begin tobuild and edit a Holo as described herein. The example user interface111100 may be generated by the systems described here and accessed overa network. Thus, a user may access, build and edit a scene from wherevernetwork access is provided. Additionally or alternatively FIG. 1kdepicts an example user interface 111100 including options for creating111102 and loading 111104 a Holo which can be viewed on a computingdevice 400106. A new Holo may be created by choosing the type of thefirst scene, i.e., either VR 111106 or AR 111110. The tile labeled111108 implements the functionality for drag-and-drop interactionrequired by the methods and systems here. Clicking the tab labeled111104 shows a list of existing Holos for a logged-in user. From thatlist, a Holo can be selected and loaded. Existing Holos 110128 areloaded from the data store 110116 and delivered by the server 110114.

FIG. 1l depicts a user interface 111200 of the editor 110232 which maybe used for creating AR and VR content, using the methods and systemsdescribed here. The user interface 111200 can be viewed and used with acomputing device 400106 over a network as described here. Through thismain editor interface 110232 a user may build out a scene using new orimported scenes and/or objects.

In the user interface 111200 example shown, a left-hand sidebar isincluded containing a list of scenes (referred to as “slides” in thiscase) 111202 in the current Holo as well as an option for creating newscenes, shown with a plus symbol 111210; also shown are a large area forediting the currently selected scene 111206; a right-hand sidebarcontaining various options for selecting, adding and enhancing objects111216; and a top menu bar containing options for saving a Holo andediting its meta data 111204 as well as for user management andswitching to the scene player mode 111214. Scenes within a Holo may beselected from the list labeled 111202. Objects 111208 can be added to ascene by interactions such as but not limited to dragging and droppingthem into the central editor area 111206 or by activating thecorresponding tab in the right-hand sidebar and choosing a predefinedobject 111400. 3D objects can either be imported in terms of a singleobject file (e.g., in the formats DAE or OBJ), which then triggers anadditional dialog asking for the corresponding material and texturefiles, or in terms of a ZIP file containing all necessary files at once.Once imported or created, all objects can be manipulated using thebuttons labeled 111212 (reposition, rotate, scale). The currentposition, rotation and size of the selected object are given at thebottom of the editor area 111220, separately for each dimension. Thebuttons labeled 111222 are for zooming, duplicating the currentlyselected object and deleting the currently selected object. Any kind ofobject manipulation buttons in any combination may be presented for usein the editor 111200. The tab in the right-hand sidebar 111216 that isactivated in FIG. 1l (the leftmost tab) shows the list of all objects inthe current scene 111218. Objects can be selected by either clickingdirectly on them 111208 in the main editor area 111206 or by choosingthem from that list 111218. In the example, the non-active tabs of theright-hand sidebar 111216 in FIG. 1l are (from left to right) for addingobjects 111400, adding/editing animations 111500 and adding/editingactions 111600 and are described below. It should be noted that anylayout could be used, additionally or alternatively to the exampleslisted here. Screens, menus, options, layouts, could all be placed onany kind of user interface in any kind of arrangement, time sequence orwindow arrangement.

FIG. 1m depicts one possible implementation additionally oralternatively of a user interface 111300 for creating a new AR or VRscene within a Holo (referred to as “new slide” in this case). It can beviewed and used with a computing device 400106. In the example, the user110102 can choose between VR 111302 and AR 111304 scenes. The arealabeled 111306 contains the three options for choosing either no marker,a 2D marker or a 3D marker for the new AR scene (cf. FIG. 1f ).Accordingly, the two tiles to the right implement the drag-and-dropinteraction used by the methods and systems here. When choosing the tabfor VR scenes 111302, the user 110102 is presented with the sameinterface as shown in FIG. 1k 111108. Furthermore, there is an option toadd 2D scenes 111308 containing only text or checklists, which is,however, secondary in the context of AR and VR.

FIG. 1n example shows the right-hand sidebar 111400 (cf. FIG. 1l ) withthe second tab (adding 2D and 3D objects) being active. It can be viewedand used with a computing device 400106. Additionally or alternativelyin this example, the button labeled 111402 provides search functionalityfor 2D icons and 3D objects based on external search application programinterfaces (APIs). In some examples, adding custom content 111404 mayinclude (from left to right) functionality for adding, for example butnot limited to, 2D text, 3D text, 2D objects and/or 3D objects in anycombination. Selection of either a 2D or 3D object may activate modaldialogs implementing the drag-and-drop functionality which may berequired by the methods and systems here. Yet, custom 2D and 3D objectscan as well be imported by user interface interaction such as but notlimited to dragging and dropping them into the main editor area 111206.The areas labeled 111406, 111408, 111410 and 111412 may providepredefined selections of 2D and/or 3D objects that can be directly addedto the currently active scene by clicking, tapping, or otherinteraction. 111406 provides a set of 3D arrows; 111408 a set of 3Dshapes (such as but not limited to any combination of, info box, boxwith question marks, coin, various crosses, cube, cylinder, variousdiamonds, crescent, hexagon, prism, pyramid, partial pyramid, refreshsymbol, various roofs, partial roof, shamrock, sphere, various stars,stop symbol, trapezoid, or other shape); 111410 a set of 2D (square,rectangle, circle, or other 2D shape) and 3D shapes (cube, sphere, cone,tube, or other 3D shape) to which custom textures can be applied. Acustom texture may be any kind of 2D image that is applied to thevirtual surface of the object 2D or 3D shapes; 111412 a set of 3D tools(various items such as but not limited to any combination ofscrewdrivers, wrench, hammer, drill, box spanner or other); and 111414 aset of 2D icons (such as but not limited to any combination of bus, cashregister, clothes, fork and knife, exit, journal, map, parking,restroom, store, theta, hiking trail, viewpoint, warning sign, speaker,ear, headphones, gramophone, music note, sound off, soundwave or other).The examples of shapes, tools, and icons here are not intended to belimited, and for different use cases, could be customized to aid theusers of the tools. The examples of construction tools is not intendedto be limiting and could be customized as well.

FIG. 1o shows an example right-hand sidebar (cf. FIG. 1l ) 111500 withthe third tab (adding/editing animations) being active. Additionally oralternatively, the example can be viewed and used with a computingdevice 400106. A set of predefined animations may be presented 111504,which can be directly applied to the currently selected 2D or 3D objectby clicking the corresponding tile. The example animations shown hereare not intended to be limiting, but include rotating the object, movingthe object in a circle, moving the object forward/sideways, pirouette,spinning the object forward/sideways, letting the object pulsate andletting the object bounce up and down. Any kind of object animationcould be offered in this example, and used to animate a selected object.Moreover, in some examples, there is the possibility to create customanimations 111502, which may be composed of a sequence of rotation,translation and scaling animations.

FIG. 1p shows an example right-hand sidebar (cf. FIG. 1l ) 111600 withthe fourth tab (adding/editing actions) being active. It can be viewedand used with a computing device 400106. The side bar shows exampleavailable triggerable commands 111604 which here include any combinationof calling a number, opening a web page, transferring to a differentscene (named “Open slide” in the figure), showing an info box containinga text, showing a warning box containing a text, sending an e-mail,starting or ending an object animation, displaying or removing anobject, and playing a sound. Any kind of triggerable command could beused, these examples not limiting. After having selected a 2D or 3Dobject in the current scene, the user 110102 can select one of theseactions, set the according parameters, e.g., specifying a URL to beopened or choosing a sound file to be played, and the action is thenassociated with the target object. In some embodiments, each object maybe associated with one action at a time that may be triggered when theobject is clicked in the systems player mode 110230 or otherwiseinteracted with by a user. If the currently selected object is alreadyassociated with a triggerable action, it may be overwritten with the newaction or new parameters if the new action is the same as the existingone (which effectively means editing the existing action). Additionally,or alternatively the existing action can be deleted by choosing theaccording option 111602. Additionally, a set of OPC UA commands 111606may be available, which may enable the display of informationcommunicated by a machine to the systems and methods here. For this, anOPC UA server may be specified in the Holo settings. Informationavailable from machines via this server can then be made available in ARand VR scenes. This may enable a number of use cases. To give only onenon-limiting example, the temperature of a machine could be displayedrelative to a specific part of the machine in an AR scenario.

FIG. 1q depicts one possible implementation of a user interface 111700,for preprocessing an imported 360° spherical image before thecorresponding VR scene is initialized. The imported 360° spherical imagecan be viewed and used with a computing device 400106. Systems andmethods here may provide predefined filters 111704, particularly forenabling the user 110102 to automatically enhance images. Some filterscan be customized for specific devices, for example, the RICOH THETAspherical camera is prone to a certain quality, particularly inlow-light settings. That is, when activating the “Theta filter” in thesystems here, the quality of a 360° spherical image captured with thecorresponding camera may automatically be improved based on predefinedheuristics. 360° spherical images can be created using a spherical orpanorama camera 390102 or using rigs of multiple cameras 390200. Besidesthe predefined filters, the user 110102 as well has the option tomanually fine-tune contrast, brightness, and color vibrancy 111706 ofthe image. A live preview may be shown at all times 111702 in certainexample user interfaces.

It should be noted that 360° spherical cameras or arrangements ofmultiple cameras which use software to stitch images together to form a360° image are used in this disclosure to discuss systems that createimages which allow a user to pan in any direction, left, right, up,down, or combination of any of these. The general goal of such a 360°image is to immerse the user in the sights of where the 360° image wastaken. For example, a 360° image is taken on a beach. A user may laterexperience the same scene where the 360° image was taken by viewing on atwo dimensional screen or by a viewing apparatus such as a 3D gogglesystem. Such arrangements may have motion detection or allow fornavigation of the 360° image by mouse, keyboard, or other arrangementwhich may allow a user to turn in any direction to view the image. Ingoggle arrangements, the headset may be synchronized to the image suchthat the user's movements are detected, and the image changescorrespondingly. These arrangements and uses of 360° images may be knownas virtual reality. Similarly, if a camera is used to capture the user'sactual environment and then the systems here are used to augment thecamera images with computer objects or overlays or other constructs, theuser may experience their actual environment, but with added computerimagery. Such arrangements may be known as augmented reality. The termsvirtual reality and augmented reality are not intended to be limitingand the systems and methods described here may be used to create either,or both. The terms may be used interchangeably in places and are notintended to be limiting in such a way.

FIG. 1r depicts one non-limiting possible implementation of a userinterface 111800 for preprocessing an imported 3D model before it isadded to an AR or VR scene. The example can be viewed and used with acomputing device 400106. In this example the system provides tools that,first, allow a change to the intensity of the lighting 111808. Second,through the tools presented, the user 110102 can redefine which sideshould be the top side of the 3D model 111810, which makes it possibleto, e.g., flip the model upside down before importing. Third, if theshadow option 111812 is activated, a different material enabling moreadvanced lighting and shadows may be applied to the 3D model. In someexample embodiments, the standard material may be matte. Finally, theuser 110102 is as well provided tools to reduce the complexity of the 3Dmodel in terms of vertices and triangles 111814 based on a version ofthe Stan Melax Progressive Mesh type Polygon Reduction Algorithmspecifically adjusted for web-based processing of 3D models. A livepreview of the model considering the currently selected options 111802as well as the current number of vertices in the model 111804 may beshown at any time in a preview scene. In the example, whether thatpreview scene is automatically rotating or not can be changed using thebutton labeled 111806.

FIG. 1s depicts one non-limiting possible implementation of a userinterface 111900 for creating a custom animation. It can be viewed andused with a computing device 400106. In the bottom part of the userinterface, a tool is presented 111902, which may be used to define acustom animation in terms of a sequence of individual rotation, scalingand positioning animations. The user 110102 may set frame points for anyof the individual animations on the timeline 111902 and then specify therotation, size or position of the selected object at that point in timeby direct manipulation within the main editor area 111904.

FIG. 1t depicts one non-limiting possible implementation of an interface112000 for the player mode of the systems and methods here, additionallyor alternatively which enable the consumption of previously created ARand/or VR content using a computing device 400106. In the example, tothe left, the user 110102 is presented with a collapsible list of the ARand/or VR scenes (referred to as “slides” in this case) 112002 containedin the Holo that is being utilized. The AR or VR content of thecurrently viewed scene is presented in the main area of the player112004. At the bottom of the example, a set of controls 112006 ispresented that may enable interaction with the three-dimensional scenein terms of panning the field of view to the left/right and top/bottom,zooming in and out, switching to full-screen mode and sharing thecurrently viewed Holo with other users. In this example, the buttonlabeled with “EDIT” 112008 enables the user 110102 to return to theeditor mode 110232 of the system, but may only displayed in case theHolo is actually owned by the user according to the Holo 110128 and usermanagement 110126 data stored on the server side 110116.

The user interfaces illustrated in FIG. 1k-1t represent only exampleimplementations of the methods and systems here and are not intended tobe limiting. Any combination of the user interfaces shown in 1 k-1 t maybe used interchangeably and in any combination or order. Additionally oralternatively, any combination of the above elements may be used. Inparticular, it is possible to provide implementations thereof that,include different elements e.g., features which are different sets ofpredefined 2D and 3D objects. These pre-defined objects may be forspecific use cases such as the construction industry or other industry.Moreover, while the interfaces described above were designed for desktopcomputers 400122, the methods and systems here could be transferred intothe context of devices with different input methods. These could includehead-mounted displays 400200 400300 that require hands-free interaction(e.g., pointing a crosshair by moving your head) or touch devices suchas smartphones and tablet computers 400102. The latest generation of allof these device classes is powerful enough to display 3D content withoutjudder. In addition, the vast majority are based on Android or iOS asthe operating system, which means that web browsers such as the mobileversions of Chrome or Safari are available, which enables the web-basedcreation and consumption of AR and VR content. Any kind of futureoperating system and browser or internet access arrangement could besupported.

Other Feature Examples

In certain example embodiments, additionally or alternatively with theexamples described above, while it is already possible to add sound to2D/3D objects in terms of a triggerable action (i.e., the sound playswhen the object is clicked), the systems may be extended with ambientsound, i.e., sound that is not bound to a specific object, butautomatically starts playing when the user enters the AR/VR scene. Inanother example, additionally or alternatively, invisible shapes may beintroduced that, e.g., can be placed in front of a certain featureembedded in a 360° spherical image. For instance, if a door is visibleon a 360° spherical image taken with a spherical camera device 270101which is described in detail in FIG. 29a , this would make it possibleto effectively make that door clickable rather than having to insert avisible 2D/3D object (like a virtual door in front of the photographedone) to realize the click interaction. One non-limiting use case forthis may be VR point-and-click adventure games. Additionally oralternatively, painting functionality may be provided, which means thatusing a brush or pencil drawing tool the user will be able to annotatefeatures directly on the 360° spherical texture in VR scenes.Additionally or alternatively, live streams may be used which may befilmed with a 360° spherical camera 270101 in VR scenes, which can thenbe annotated with 2D and 3D objects and consumed by multiple users inreal time. These non-limiting examples could be combined with anyexamples listed throughout this disclosure and could be augmented orused alone with other features.

Scene Creation and Timeline Examples

FIG. 2a to FIG. 2y give an example overview of the system providing amethod of a timeline-based functionality additionally or alternatively,which includes the functionality of automatic creation of timestamps andchronological structuring of 360°-images and video, as well asrearranging, adding and deleting of timeline-elements by the user. Thus,in the example, the system fulfills the need to have chronologicallystructured image or video data visualized in a meaningful manner. Basisfor the depiction of changes in the user-selected geographical locationover a period of time are images and/or videos covering a field of viewof up to 360°, including and not limited to, various commercial andnon-commercial usage scenarios such as but not limited to constructionsites, natural environments, shops, restaurants, offices, museums, parksas well as unrelated personal pictures and the like. In context of thismethod 360° images and video include, but are not limited to,full-spherical images and video with a field of view up to 360°. The360° images and video can be supplied by hardware systems including, butnot limited to, the devices depicted in FIG. 29a , FIG. 29b , FIG. 29c ,FIG. 30a . When added to the system, a 360° images or video can bepresented as a Holo that can be further enriched with 2D and/or 3Delements. The resulting Holo can be viewed with, but is not limited to,the hardware systems depicted in FIG. 30a , FIG. 30b , FIG. 30c , FIG.30d , FIG. 30e , FIG. 30 f.

FIG. 2a shows an example how different users 120110, 120118 could beusing the proposed systems and/or methods as it describes two distinctuse case, non-limiting examples. The user 120110 could either work withan existing system 120120, which uses the systems here 120104 and itsinterfaces to create and output content, and/or the user 120110 coulddirectly interact with the proposed system 120104. Either way, the user120110 may provide the data 120106, which he wants to work with, to anexisting system 120120 or the proposed system 120104. The user 120110,as well as the data 120106 provided, interact over well-specifiedinterfaces 120108 with either system. The data 120106 may be gathered byhardware systems including but not limited to devices as depicted inFIG. 29a , FIG. 29b , FIG. 29c , FIG. 30a . Both interfaces 120108 maymeet the requirements that the user's 120110 needs and system impose,for example a web-browser or smartphone and the like. If the user120110, 120118 interacts with an existing system 120120, the system canuse the systems' interface 120122 to make use of the capabilities itprovides. As the system 120104 is able to edit, as well as view content120116, in certain example embodiments, an existing system 120120 mayprovide either a player/viewer 120114 module or an editor 120112, orother module 120102 as part of itself. Either system 120120, 120104 mayoutput content 120116 to a user 120118 which another user 120110produced. The viewing user 120118 can be an arbitrary user with whom theediting user 120110 shared the content 120116 or the user 120110himself. In any case, an interface 120124 may provide a way in which theviewing user 120118 is able to consume content 120116 on hardwaresystems including but not limited to devices depicted in FIG. 30a , FIG.30b , FIG. 30c , FIG. 30d , FIG. 30e , FIG. 30 f.

FIG. 2b shows an example additionally or alternatively of a high-levelinteraction diagram of how a user 120110 may interact either with theproposed system 120104 as a standalone application, or with the systemshere 120104 acting as an extension to an existing system 120120. Theuser 120110 may use one of the computer interfaces 120108 to interactwith the underlying system to gain access to his content (managed asproject entities called Holos 120212). In this example, a Holo 120212comprises visual data 120106 created by hardware systems including butnot limited to devices as depicted in FIG. 29a , FIG. 29b , FIG. 29c ,FIG. 30a . This underlying system can be either the proposed system120104 or another existing system 120120. If it is the later, theexisting system 120120 uses the interface 120122 to the systems 120104to pass stored Holos from the external storage 120210 to the systems120104 via its API. Otherwise, the system 120104 uses interface 120122internally. Thus in the example, the user 120110 can now either start anew Holo 120212 from scratch, or load an existing Holo from an externalstorage 120210. After the system created a new Holo 120212 for the user120110 by requesting data, such as name and description from him, orloading an existing Holo from external storage 120210 or internalstorage 120208, the user 120110 may be able to create a new 360°timeline 120202, or edit an existing 360° timeline 120202. The user canadd various 2D/3D content 120204 to the new/existing 360° timeline120202 through an integrated library or, methods including but notlimited to hardware related systems as depicted in FIG. 29d . At anytime in between these steps 120202 120204, the user 120110 may be freeto preview his creation using the player 120206, saving his Holo toeither internal 120208 or external storage 120210 or continue toadd/edit 360° timelines 120202 and add content 120204.

FIG. 2c represents an example system's data structure, how the systemrepresents a Holo 120212 and gives an example as to how it allows a userto map locations to various points in time. The systems described heremay manage assets such as but not limited to 360° images and videos120308, represented as OpenGL/WebGL rendered scenes, a scene 120304. Ascene in this example may have numerous attributes such as name,description but could have others as well or in combination. Someaspects include data storage for the user's 120110 data 120106 as wellas a timestamp 120310 at which the image or video data has beencaptured. The feature that allows a scene 120304 to have subscenes120304 enables the system to give various points in time a common parentscene 120304, which represents the location, while its children maydenote the various points in time. Scenes may be saved 120304 in a Holo120212 data-structure called Holo 120302. To give an example, a Holo120302 may be able to perfectly represent 120312 a construction site.While various locations (here for example, rooms 120314) correspond totop-level scenes 120304, the location's change over time 120316corresponds to the top-level scene's 120304 children 120306.

FIG. 2d shows an example detailed view on how 360° images and videos areprocessed and presented to the users. The 360° images and video files120402 may be conducted with hardware systems including but not limitedto devices as depicted in FIG. 29a , FIG. 29b , FIG. 29c , FIG. 30a .After the user 120110 or an embedding system 120120 passes image orvideo data 120402 to the system 120104 it may generate data-urls 120404for both image and video data 120402 in order to make them easilyembeddable into the desired (web-) page. If the data is not an image nora video file, the system 120104 may cancel the import and send anappropriate response to the user 120110. In certain examples, as soon asthe data-urls 120404 are created, the system 120104 may extract metadata 120406 such as the capture time and other relevant information fromthe provided data 120402. This information may be used to generate thecorrect order for time-based scenes. In the next step 120408, a dialog121202 presents various image and video preprocessing options 121206121208 to the user 120110. In certain examples, he can select thedesired options 121210 and apply them with an interaction such as butnot limited to click on the ‘Add’ button 121212. This may start theprocess 120410 to apply the settings to the raw image or video data. Toprovide the user with the best possible experience and reduce processoverhead, an application of additional modifications may be made if120412 the files to import are 360° image files. If the data is a 360°video, a process may be started 120426 to load it from its data-URL andapply it as a texture to a standard high-resolution 360° sphere object120432 that the user sees. If 120412 the data is a 360° image, the sizeand proportions may be used as well as custom algorithms to performadjustments to the 360° image orientation 120414. By adjusting theimage's orientation, a more realistic and true-to-life image may becreated. Next, in some embodiments, alternatively or additionally aprocess may be applied 120416 which may be referred to as image slicing.

In order to improve load times and increase performance of displayingimages, and because it may be beneficial to show the user 120110 the360° image as soon as possible, in order to improve the experience andreduce waiting time while looking a progress bar indicating images areloading, the whole 360° image may be divided into multiple smallerparts. These smaller parts can be loaded independently in whicheverorder the system determines, thus it is possible to display 120428particular parts immediately while the rest are still loading. In orderto keep the image quality on a high level, each slice may have adimension of 2048 by 2048 pixels for example. In the process of slicingthe image, a lower resolution thumbnail 120420 may be created forimmediate display 120428 to the users. In such an example, the systemmay display a slightly blurred image of the final sphere. A process120424 may be started to assemble all slices into the final sphere anddispose the low-resolution thumbnails to save memory on the users'device. The process 120426 may then be started which loads and appliesall high-resolution slices and passes the sphere to the renderer. Thewhole import in this example finishes by displaying the high-resolution360° sphere 120430 to the user on hardware systems including but notlimited to devices depicted in FIG. 30a , FIG. 30b , FIG. 30c , FIG. 30d, FIG. 30e , FIG. 30f . In some embodiments, only the sphere slices thatare viewed by a user are fully downloaded and displayed, those that arenot viewed may not be loaded.

FIG. 2e depicts an example process for creating and editing a360°-timeline. The 360° images and video files 120402 may be conductedwith hardware systems including but not limited to devices as depictedin FIG. 29a , FIG. 29b , FIG. 29c , FIG. 30a . Once the data 120106120402 is imported using the process described, for example, by FIG. 2d120502, the system 120104 checks whether the date from the data'smetadata was extracted. If the date metadata was extracted, the system120104 may save the date into the scene associated with the currentlyprocessed data 120402. In some example embodiments, if a user 120110added additional content, such as 2D/3D objects to a time-based scene,he wants to transfer this data to the following time-based scene. Hence,the system 120104 checks whether there is content available in theprevious scene, and if so, clones and/or copies the content and adds itto the new time-based scene representing the current data. Either way,the content for a new scene may be used to create a view 120508 in theuser-interface for it. This representation in the user-interface maymake up the items (scenes) represented in a timeline. The timeline121002 may order this representation correctly to put it at the correctposition 120510 in the timeline in relation to the already present data.If a series of image or video data is added, the first image to the user120110 120118 may be displayed in order to increase the user-experience.The system 120104 may use the renderer 120428 to update and display thefirst low-resolution location-based scene to the user on hardwaresystems including but not limited to devices depicted in FIG. 30a , FIG.30b , FIG. 30c , FIG. 30d , FIG. 30e , FIG. 30f . If all the data hasbeen processed, all dates and the order 120512 of the images and videosmay be validated. If they are valid the 360° timeline creation processis complete 120516, otherwise the user may get the choice 120514 toadjust, rearrange and edit the whole timeline manually.

FIG. 2f details example steps for the user 120110, as well as the system120104, to add the new data 120402 created with hardware systemsincluding but not limited to devices as depicted in FIG. 29a , FIG. 29b, FIG. 29c , FIG. 30a to an existing timeline 121002, which representsthe time expansion of a location-based scene. In the example, to add anew time-based scene to a timeline 121002, the users can use severaldifferent methods 120602. In either case, the example system 120104 maypresent a dialog 121202. If the data has been added, for example,dragged onto the timeline 121002 directly in the UI, the preview panel121204 may be populated 120606 with a thumbnail of the data. Otherwise,users may interact with, for example drag 120605 the data into the panel121204 beforehand in the example UI. Panel 121204 also offers the usersome options 121210, which can be applied 120408 to change the data'sappearance (custom filters among others). In certain embodiments, thesystem 120104 may set 120608 these options and save them into thescene's data storage and if necessary will copy existing content 120506to the new scene. If the option to auto set date/time 1206 has beenchecked, the system 120104 will start a process 120504 to set thisdate/time according to the meta data in certain examples. After thesystem set the date/time in this example, the user 120110 may see a newdialog 121502. Either the user may accept and apply the order, which thesystem has determined by clicking ‘Apply’ 121518, or he can set the newtime-based scene to a new date/time, and therefor position in thetimeline 121002, using a different process 120514. If he does theformer, the system 120104 adds the new element at the specified positionaccording to the set date and time 120510.

FIG. 2g shows an example process of adding external content to a scenecomprising 360° images or video. The image and video data can be createdwith hardware systems including but not limited to devices as depictedin FIG. 29a , FIG. 29b , FIG. 29c , FIG. 30a . In the example, when theuser 120110 selects a scene he will see a new panel 120914 in the UI. Anembedding system 120120 can use an interface 120122 to pass 2D/3D modelsin various data formats to system 120104 that generates a representationfor it in this panel 120914. If no data is present, the user 120110 canselect data through a process 120706 from his own storage and let thesystem 120104 import this data for him. If data is already available orhas been imported, the user can select this data 120702 from the panel120914 to pass to the process 120706 which, in the example, will add theselected object to the current scene. Furthermore, the user can edit theobjects via another workflow denoted by another process 120704.

FIG. 2h gives example details to make the data flow inside the systemsand/or methods 120104 as well as the data flow between an embeddingsystem 120120 more transparent. The used data can be created by hardwaresystems including but not limited to devices as depicted in FIG. 29a ,FIG. 29b , FIG. 29c , FIG. 30a . The data 120106 is passed 120801 eitherinto the existing system 120120 and then to the proposed system 120104via the interface 120122, or directly 120803 to the system's 120104 UI120805 which accepts the data and passes it on to the various loaders120804 which handle the loading and import. After that, alternatively oradditionally in certain examples, the image- or video preprocessingunits 120806 take care of applying the filters and custom options a user120110 has set. The processed data 120106 gets passed on to createinternally used data structures like scenes 120808 or Holos 120812. Incertain examples, there may be more 120810 to the systems than thesetwo, but they are mentioned as they represent the information and datamost users 120110 120118 will work with. Now that the data is ready topresent itself to the user inside the system 120104, interfaces toWebGL/OpenGL 120814 may be used to render 120816 either the system'scanvas 120818 or an external canvas 120820, which can be achieved byusing the interface 120122 provided in FIG. 2 a.

FIG. 2i illustrates an example combined user interface (UI) of thetimeline system 120104 and the existing system 120120 as a VR editor forcreating and editing VR tours with 360° images and videos 120902 withclosed timeline panel. While this particular UI has been customized fora web-based application, the systems can be used on various devices,like but not limited to computing devices, tablets, smartphones andsmartglasses. Depending on the device the usage and thus the UI willvary. Therefore, this and the following UI diagrams are to be understoodas one of numerous design possibilities and non-limiting. The image andvideo files used as exemplary data for the exemplary UI illustrations inthe following can be created by hardware systems including but notlimited to devices as depicted in FIG. 29a , FIG. 29b , FIG. 29c , FIG.30a . The output hardware system can be but is not limited to thedevices depicted in FIG. 30a , FIG. 30b , FIG. 30c , FIG. 30d , FIG. 30e, FIG. 30f . In the examples, the user 120110 may be able to add andedit 2D as well as 3D content 120914 in the uploaded 360° image or videothrough the UI. Newly added time-based scenes 121004 121006 121008 caninherit 2D and 3D content 120914 of their preceding location-based scene120906 120908 or time-based scene 121004 121006 121008. The user 120110may add at least one location-based scene 120906 120908 to start a newtimeline for the depicted location. Each location-based scene may belisted in the scene overview 120916 as scene previews. To create avirtual reality tour for a specified area, the user 120110 may be ableto add multiple locations 120912 as location-based scenes 120906 120908.Each location represented as location-based scene 120906 120908 can holdits' own timeline. In some examples, the user 120110 can add atimeline-element to every location-based scene 120908 without anyexisting time-based scenes with the according ‘Add scene’-button 120910in the UI. The current date and time 120904 as well as a specified name120904 for either location-based as well as time-based scenes may bedisplayed anytime (expanded as well as closed timeline panel) in theworking area 120900 of the editor UI. To expand the timeline panel for alocation-based scene with timeline, the user 120110 may click on thelocation-based scene (preview) 120906.

FIG. 2j shows a possible visualization of the expanded timeline panel121002 of the timeline system 120104 in an exemplary embedding system120120 as introduced in FIG. 2i . A timeline for a specificlocation-based scene 120906 may hold at least two time-based scenes astimeline-elements 121004, 121006, 121008 with one of the scenes beingthe location-based scene 121004 marking time t0. Each timeline-element121004 121006, 121008 may have a date and time and are sortedchronologically by date in ascending order. The user 120110 can changebetween the created time-based scenes 121004, 121006 121008 by clickingon the according timeline-element 121004, 121006, 121008. The caption120904 on top of the 360° image or video displays the date and name ofthe currently selected location-based 120906 or time-based scene 121004,121006, 121008. To gain more working space or have a better view of the360° image or video 120902, the user 120110 can hide 121010 the timelinepanel 121002.

FIG. 2k illustrates one possible implementation example of the userinterface to add new time-based scenes 121004, 121006, 121008 to anexisting timeline 121002 of a location-based scene 120906. Newtime-based scenes 121004, 121006, 121008 can be added by the user 120110at either a pre-selected area between two existing timeline-elements121102 or at the end of the timeline 121104. A 360° image or video caneither be dragged or dropped at the specified areas 121102, 121104 or beselected through a common browsing function.

FIG. 2l to FIG. 2n represent example modals 121202 for configuringpredefined settings of a new time-based scene in an existing timeline121002 opens when adding 121102, 121104 further timeline-elements. Theuser 120110 can upload 121204 either 360° images and videos by browsing,as shown in FIG. 2l , or by drag and drop, as depicted in FIG. 2m . Ifthe user already chose a 360° image or video, a preview of the selectedimage or video may be displayed 121402, as illustrated in FIG. 2n . Anuploaded 360° image 121302 or video can be auto-aligned 121206 in thetimeline 121002 by the timeline system 120104 or manually by the user120110 himself. When creating a new timeline-element the 2D and 3Dcontent 120914 of their preceding location-based scenes 120906 ortime-based scenes 121004, 121006, 121008 can be cloned 121208 andinserted into the newly added time-based scene. The level the varyingquality of the 360° image and video material taken over a period oftime, the user can adjust the lightning and contrast with predefinedimage and video settings 121210.

FIG. 2o to FIG. 2q illustrate example modals 121502 to (chronologically)insert newly added 121508 and reorder 121702 existing time-based scenes121510, 121512 in the timeline 121002, 121504 of a location-based scene120906, 121506. Each scene 121506, 121508, 121510, 121512 in thetimeline 121504 has a scene preview, name, and date and time as definedin the scene settings which the user 120110 can edit 121520, as furtherdescribed in FIG. 2s . When adding a time-based scene 121508, 121510,121512 with auto set date and time or the user 120110 added the newtimeline-element at a certain time 121102, the timeline system 120104will automatically insert the scene between the existing scenes 121506,121510, 121512 of a timeline 121002 in the timeline overview 121504.Alternatively, the user 120110 can autonomously or automatically set thedate 121514 with any calendar or times such as month, day and year, andthe time 121516 with hours, minutes and seconds. A new time-based scene121602 may be inserted at the end of the timeline 121504 if the scene isnot added 121104 at a certain time in the timeline 121002. Alternativelyor additionally, a user 120110 can reorder newly added as well asexisting time-based scenes 121510, 121512, 121508, 121602 in thetimeline 121504 subsequent to the location-based scene 121506 at anytime by drag and drop 121702. Any changes to the order of the timeline121504 and date/time 121514, 121516 of a newly added scene, as well asexisting time-based scene 121508, 121602, may be confirmed by the user120110 in order for them to be applied to the timeline panel 121002.

FIG. 2r depicts an example showing the Holo shown for example in FIG. 2kthrough FIG. 2q newly created timeline-element 121802 chronologicallyinserted into the timeline 121002 of the exemplary location-based scene120906 at a subsequent time. The new time-based scene may show a 360°image or video of the location represented in the location-based scenes120906 at a later point in time. In some examples, the system may clone2D and 3D content of the preceding scene to the new time-based scene121802 and place it at the same positions or nearly the same positionsin the new 360° image or video 121804. Additionally or alternatively thecaption 120904 at the top of the 360° image or video of the scene mayshow the corresponding date and time as well as the name of the selectedscene. By clicking on the selected timeline-element in the timelinepanel 121002 the user 120110 may open the modal 121502 for ordering thetime-based scenes 121802, 121006, 121008 of a timeline 121002, as wellas the modal 121902 for editing the settings of the existing time-basedscenes 121802, 121006, 121008 as well as the location-based scene121004.

FIG. 2s illustrates an example modal 121902 for editing the scenesettings 121904, 121906, 121908,121910 of time-based scenes 121802,121006, 121008 of an existing timeline 121002 to a location-based scene121004. In such an example, the user 120110 can name 121904 the currenttime-based scene, change the date and time 121906, give a description121908 of the scene to add any of various things including but notlimited to, for example, personal notes, and/or a thumbnail 121910 asscene preview. When changing the date and time 121906 of a time-basedscene 121802, 121006, 121008, the scenes of the timeline 121002 may beautomatically reordered chronologically with ascending date and time. Insome examples, the user can also rearrange the scenes by changing thescene order modal 121912, as described in FIG. 2o to FIG. 2q . The user120110 may save the settings, after he has changed them, by indicatingsuch as by clicking the ‘Save’ button 121914.

FIG. 2t illustrates one possible implementation of a combined UI 122000of the timeline system 120104 and the existing system 120120 as a VRplayer for viewing VR tours with 360° images and videos 122002 withclosed timeline panel 122104. The timeline 122004 in the viewer 122000is expandable by user interaction to navigate through the time-basedscenes. In the example, the viewer 120118 can navigate between locationswith a separate navigation panel 122006, 122008 with a dropdown menu122006 or other selection setup, thus enlisting all existing locationsand an option 122008 for changing to the previous or followinglocation-based scene. In the example, the dropdown menu 122006 enablesthe user 120118 to select a specific location of the existinglocation-based scenes directly. The user 120118 can interact with thescene 122002 through given controls 122010, 122012, 122014.

FIG. 2u and FIG. 2v depict example embodiments of an expanded timeline122102 as visualization of the timeline system 120104 through which theviewer 120118 can follow the changes in the 360° images or videos122002, 122202 of a certain location over a period of time. The timelineexample 122102 has one location-based scene 122104 and at least onetime-based scene 122106, 122108, 122110, 122112. The viewer 120118 canopen another time-based scene 122106, 122108, 122110, 122112 byselecting the appropriate timeline-element 122104, 122106, 122108,122110, 122112 displaying the changed scene 122202 of the location at asubsequent time in contrast to the original scene 122002, as shown inFIG. 2v . When switching 122006, 122008 between location-based scenes122104 the timeline 122102 changes accordingly. The viewer 120118 canhide 122114 the timeline 122102 to gain a better view on the scene122002, 122202.

FIG. 2w illustrates an example, additional or alternative implementationdesign to the user interface presented in FIG. 2i to FIG. 2s for thevisualization of the timeline system 120104 as editor for creating andediting VR tours with 360° images and videos. The time-based scenes122304, 122306, 122308 are represented as dropdown menu attached to thecorresponding location-based scene 122302 in a vertically structuredscene overview 122318 but could be any kind of user selection setup. Inthe example, the user 120110 can add 122310 time-based scenes at theappropriate location-based scenes 122302 122312, 122314. Furtherlocation-based scenes can be added 122316 to the overview and reorderedby drag and drop or other interaction for example. All modals 121202,121502, 121902 presented in FIG. 2i to FIG. 2s may also be applicablefor this user interface.

FIG. 2x and FIG. 2y illustrate an additional or alternative UI for thetimeline system 120104 for a VR player for viewing 360° images andvideos to the implementation presented in FIG. 2t . The timeline system120104 is visualized in the example by a dropdown menu 122404 enlistingall existing time-based scenes with date and time as well as name of thescenes but could be any kind of user interaction selection setup. Theviewer 120118 can either navigate 122404 to the preceding or succeedingscene or jump directly to a selected scene from the dropdown list 122404in the example. With a second dropdown list 122402 the viewer 120118 cannavigate between location-based scenes, just as in FIG. 2t . FIG. 2yshows the expanded dropdown menus 122402 122404 displaying the containedscenes. Any combination of lists or other displays could be used, theinclusion of two drop down menus is merely exemplary.

FIG. 2i to FIG. 2y illustrate example user interface(s) embodiment(s) ofthe methods and systems described here. In particular, it may bepossible to provide various implementations depending on the use case,e.g. simple, practical UI for construction, more artistically forinterior design and/or tailored to the user devices. While theinterfaces described above were designed for desktop computers andbrowsers, the methods and systems here can be transferred into thecontext of devices with different input methods such as head-mounteddisplays that use hands-free interaction (e.g., pointing a cross hair bymoving your head) or mobile touch devices such as smartphones, laptopsand tablets with touch screen interfaces.

Holo Structure Examples

It is to be understood that the disclosed subject matter is not limitedin its application to the details of construction and to thearrangements of the components set forth in the descriptions orillustrations herein. The disclosed subject matter is capable of otherembodiments and of being practiced and carried out in various ways aloneor in combination with any of the other embodiments. Also, it is to beunderstood that the phraseology and terminology employed herein are forthe purpose of description and should not be regarded as limiting.

FIG. 3a illustrates an example additional or alternative overall processof adding a ground based map to work with existing or newly createdHolos in order to provide context for the Holos when viewed. Forexample, the ground based map could be imported from an amusement parkwhich provides a map of its physical space and the user then importsHolos for the various points within the park for the end user viewer toexperience. Other example use cases may be for museums, using a map ofthe physical museum layout with imported Holos for viewing. Anothernon-limiting example is that of a Floor Plan of a house or otherbuilding which may be under construction. By importing a map or buildinga map in the system of the building, floor by floor, Holos may becreated or imported to show the various rooms. The embodiments describedbelow use the term Floor Plan which is not intended to be limiting.

Floor Plan to an existing or newly created Holo using a technical device400106. The described process is couched as an additional Floor Plan foran existing Holo but is not limited to this usage scenario. The examplesstarts with a newly created or existing Holo in 130102. In 130104 thesystem offers an interface for importing one or more Floor Plans fromvarious file formats such as but not limited to documents, images orthird party content as described in detail in FIG. 3b . The importedFloor Plans may be interconnected to one or more Scenes of the Holo in130106 as described in detail in FIG. 3c . Finally in 130108 the createdFloor Plans and Hotspots are added to the data structure of the Holo andthe Holo is saved to the Server.

FIG. 3b illustrates an example additional or alternative process ofimporting a Floor Plan from various sources and formats using atechnical device 400106. This process can be done numerous times toimport multiple Floor Plans. Depending on the file format of theexternally created Floor Plan a specific import process is selected in130202:

For image file formats 130226 like JPEG, PNG or GIF the system mayautomatically apply adjustments and customizations using various imagefilters controlled by the user in 130204. The customized image is thenused to extract a tailored Floor Plan Image 130206 which is thenuploaded to the system in 130208.

Document file formats 130224 like PDF, DOC etc. may be previewed,adjusted and customized under the user's control using a specificDocument Renderer in 130218. The tailored Floor Plan Image extractionfrom the document in 130220 is described in detail in FIG. 3d . Theextracted Floor Plan Image is uploaded to the system in 130208.

Third Party Content 130222 for example, from online maps, Computer AidedDesign systems, document and/or cartography services or other sources,PDFs, or other file types can be used as Floor Plans either byextracting content or linking to it via deep linking 130216. Adjustmentsbefore the extraction or linkage in 130214 may be done either on theplatform of the Third Party Content or by a guided assistant in thesystem using a plugin for the Third Party Content. Finally, in 130212,the information to the uploaded Floor Plan Image or Third Party Contentmay be added to the data structure of the Holo.

FIG. 3c illustrates an example additional or alternative process ofinterconnecting a selected Scene with a location on an imported FloorPlan using a technical device 400106. First, in the example, the Scenemay be selected 130302 through navigating to the desired Scene.Similarly, the desired Floor Plan may be selected in 130304 displayingit enlarged in the system's user interface. Using the enlarged FloorPlan a location to which the selected Scene should be interconnected tocan be pointed out on the Floor Plan 130306. The interconnection betweenthe Scene and a location on the selected Floor Plan may form a Hotspoton the Floor Plan. These Hotspots may be represented in the userinterface by overlaying icons on the Floor Plan as illustrated in FIG.3e , FIG. 3f and FIG. 3g . In 130308 the user can additionally add anorientation to the created Hotspot as shown in FIG. 3 h.

FIG. 3d illustrates an example process of extracting a high definitionversion of a Floor Plan from a document after applying transformationslike cropping or rotation using a technical device 400106. In favor ofdisplaying a document including one or more Floor Plan Graphics, in130402 the system may render preview images of each page in the selecteddocument using an appropriate document renderer. In the example, theuser may select a preview image and apply Transformations like croppingor rotation in 130404, using an interface like illustrated in FIG. 3f ,in order to extract a Floor Plan or a certain portion of it from theselected document page. Transformations applied by the user may berecorded by the system and the system may update continually and/or on aschedule, preview according to the applied transformations. Finally, inthe example, when the user reaches the desired extract of a Floor Planthe system may use the recorded Transformations and render a highdefinition version of the Floor Plan in 130406, using the appropriatedocument renderer. This time in a high definition mode in order toproduce a detailed high definition Floor Plan Image.

FIG. 3e illustrates an example additional or alternative user interfaceof the Editor editing a Holo including Floor Plans using a technicaldevice 400106. In the example, a Scene List 130508 in the UI lists thecurrently active Scene 130502, and further Scenes 130504 in the currentHolo. More Scenes can be added using an interface opened by the“Add”-button 130506 or other interaction. The central section 130510 maydisplay the content of the currently active Scene 130502. In such anexample, it may be overlayed by the Floorplan Interface 130526 in thetop left, or other location. This interface 130526 may comprise a“stack” of Floor Plans 130512 130514 with the active one 130512 on thetop and further Floor Plans 130514 of the Holo below. On the activeFloor Plan 130512 multiple Hotspots 130522 and 130524 may be overlayed.In the example, the Hotspot 130522 may be interconnected to the activeScene 130502 and overlaid as active Hotspot 130522 on the active FloorPlan 130512. Further Hotspots 130524 on the active Floor Plan 130512 maybe overlaid on the active Floor Plan 130512 using a different icon orcolor. Beside the active Floor Plan 130512, a set of tools for addinganother Floor Plan 130516 to the Holo, replacing the active Floor Plan130518 and removing the active Floor Plan 130520 is shown.

FIG. 3f illustrates an exemplary user interface of the Editor for theimport of a Floor Plan using a technical device 400106. As describedabove and in FIG. 3b if the user imports a Floor Plan based on the fileformat a corresponding interface is presented. In FIG. 3f an exemplaryuser interface for image files or documents is illustrated. The dialog130602 in the example is made of a preview area 130608 with an attachedToolbar 130610, a page selection control 130612 and buttons to cancel130604 the process or adding 130604 the transformed Floor Plan. Asdescribed in FIG. 3b and FIG. 3d the preview area 130608 together withthe Toolbar 130610 can be used by the user to apply transformations likecropping or rotation to the previewed image. The buttons and icons inthe Toolbar 130610 can vary depending on the possible transformations.Considering the file format of the Floor Plan source the transformedimage may be used directly or recreated in a high definition versionusing the resulting transformation steps as described on FIG. 3b andFIG. 3d . In case of a document source a page selection control 130612is displayed below the preview area.

FIG. 3g illustrates an exemplary user interface for Hotspot 130704130706 navigation and creation on an enlarged Floor Plan 130702 using atechnical device 400106. An active Floor Plan 130512 as seen in FIG. 3ecan be enlarged by the user. The enlarged Floor Plan 130702 may beoverlaid (similar to the smaller Floor Plan representation 130512 inFIG. 3e ) by the interconnected Hotspots 130704 130706 with differentcolors and/or icons. If the active Scene 130502 is interconnected to aHotspot 130704, this Hotspot 130704 may be highlighted compared toHotspots 130706 interconnected to non-active Scenes 130504. The user maycreate or rearrange a Hotspot 130704 for the active Scene 130502 byselecting the desired location on the enlarged Floor Plan 130702.Rearranging an existing Hotspot can be done by the user withdrag-and-drop or other interaction with the UI. A selection of anexisting Hotspot may navigate the user to the interconnected Scene. Thecolors, icons and positions of the overlaid Hotspots may be updatedaccordingly.

FIG. 3h illustrates an exemplary user interface for representation andaddition of orientation 130802 to Hotspots 130704, 130706 on a FloorPlan 130702 using a technical device 400106. Further to the Hotspots130704, 130706 shown in FIG. 3g Hotspots 130704, 130706 may be extendedwith an orientation 130802 which represents the orientation of theinterconnected Scene. Similar to simple Hotspots the Hotspot withorientation 130704 interconnected to the active Scene differs from theinterconnected ones 130706 to non-active Scenes. Icons of orientedHotspots 130704, 130706 may be augmented with for example an arrow130802 or other graphic indicating the orientation 130802 of theinterconnected Scene within the Floor Plan 130702. The orientation of aHotspot 130704, 130706 can be added or edited using different methods oruser aided flows. Additionally, turns performed in the Scene's VRinterface 130510 may be synchronized with the orientation 130802 of theHotspot's 130704, 130706 icon.

Orientation Examples

Additionally or alternatively, the systems and methods here may supportvarious orientation features. FIG. 4a is an illustration of a scenario140100 of an example system orientation example. In the example, thenorth direction 140104 of a first panoramic image 140102, which iscreated by a spherical or panorama camera FIG. 29a , is defined, eithermanually by the user or in an automatic process, for example deliveredby the camera device FIG. 29a . Furthermore, each panoramic image may bethought of having its own virtual camera 140106 which represents thedirection of the user facing the panoramic image. When multiplepanoramic images are connected to each other and the rotation of thevirtual camera 140106 on each panoramic image is not synchronized, eventhough it shares a similar small part of the scene, problems may occurin their display. To overcome these problems, the virtual camera 140112of the second connected panoramic image 140108 may be determinedautomatically by calculating the angle 140113 between the previous northdirection 140104 and its corresponding virtual camera 140106. This angle140113 is the same angle 140113 between the north direction in the nextimage 140110 and its virtual camera 140112. By using these two referencedirections, in the two images, without any user input, the orientationof the panoramic images may be synchronized.

FIG. 5a illustrates an exemplary way 150100 to apply a position to aphoto in a scene on the system, relative to an imported floor plan. Thesystem in the example comprises a camera 150102, 390102, including butnot limited to a fully 360° spherical camera, a computing device 150108,400106 with a display 400110 that supports user input and is able to runsoftware, including but not limited to, smartphones, tablets andsmartwatches. A digital document 150110 that provides a form oforientation to the user may be used including but not limited to mapsand/or floorplans. This document may be presented by the display to theuser through the device 150108, 400106. Through software running on thedevice 150108, 400106 the user may be enabled to interact with thisdocument. The device 150108, 400106 in the example does not have to beconnected to the camera 150102, 400108, neither via cable norwirelessly. When the user takes a photo 150106 with the camera 150102,400108, he can also interact 150112 with the document 150110 on thedevice 150108, 400106, for example by pressing on the screen. With thisinteraction 150112, the user can choose the appropriate location on theexample document 150110 where the image was taken with the camera150102, 400108.

FIG. 5b illustrates 150200 how, added locations 150202 could bevisualized on a device 150108, 400106. The example method is able tosupport multiple images and multiple locations 150202. After one ormultiple photos have been taken and one or multiple locations 150202have been chosen, one or multiple locations 150202 could be visualizedon a device 150108, 400106 on a map 150204 or other display such as afloorplan, at a place where the user has chosen the locations 150202.

FIG. 5c is an example flowchart that illustrates a use case where a user150302 regularly revisits locations in the real world which are depictedin the floor plan, to take photos at various points in time of theselocations, for example weekly photos of every room in a building wherethe walking tour could be the same every week. When a user 150302 takesa photo 150304 with a camera 400108, in the example, the photo can bestored together with a timestamp 150306. The user 150302 canadditionally or alternatively store a corresponding location on anotherdevice 400106. The user 150302 can either choose a new location 150308,or accept a waypoint 150312 that was previously created. If the userchooses a new location 150308, a new waypoint 150310 may be createdwhich can be used in following visits of this location. After the user150302 has chosen a location, either a new location 150308 or byaccepting a waypoint 150312, both the position and a timestamp can bestored 150314 by the system. This process allows the system toautomatically assign one location to different photos at differentpoints in time. When the user 150302 wants to visit similar locations ina similar order than before, the system can even predict where the usertook the photo by accepting a waypoint 150312 automatically except forthe first time a location is added 150308. The user 150302 can skipcertain locations, for example if the location is not reachable at agiven time. After the user 150302 finishes his tour of the physicallocation, photos and location information can be processed 150316 on acomputing device 400106, for example adding annotations, sorting oruploading to a server. Optionally this can be done during the tour, butdoing this afterwards does not require a connection between the phototaking device and the one where the user 150302 chooses the location.

FIG. 5d illustrates an example of the system which allows annotation ofphotos. After a photo 150402 is taken, including but not limited to,full 360° spherical photos, a user can add annotations and otherarbitrary elements to it using a computing device 400106, including butnot limited to images 150406, textual annotations 150408, 3D elements150412 and drawings 150410. It is also possible to add audio annotations150414, either by using an existing audio file or by recording audiowith a microphone. The resulting scene 150404 can then be storedadditionally to the original photo 150402.

Layout Examples

Additionally or alternatively, the systems and methods here may be usedto structure different layouts. FIG. 6a illustrates an examplevisualization of a creation-mode in the system which may be referred toas a canvas and example features, including but not limited tomeasurement tools used to measure distances and angles. Measurementtools can be used to measure geometric dimensions in the canvasincluding but not limited to distances and angles within specifiedcanvases. This can be done from devices with touch screen, head-mounteddevices as in FIG. 30c-f , desktop computing devices as in FIG. 30aequipped with a keyboard and a mouse or any other pointing or suitableinput device.

Measurements can be made on 360° pictures shot with devices includingbut not limited to spherical cameras, as in FIG. 29a . As shown in FIG.6a in the reference-canvas-creation-mode 160134 a reference canvas maybe created 160128 which may be used later as the reference to createshapes on it, measure distances on the canvas 160128, projecting 160142the sphere texture 160150 onto the canvas 160128 and modifying thecanvas 160128 shape until it fits the physical structure (for example awall) it is representing in the background footage, including but notlimited to 360° images and video frames. In an example, to create acanvas, the user can start at the floor 160130 of the scene in the UI tomark the complete floor area 160132 and this way indicating where thewalls 160128 are starting. Separate reference canvases 160128 may becreated by marking multiple positions on the different borders 160112,160104, 160106, 160108 of the canvas 160128. One possible user flowexample is to first choose a position 160102 on the lower border 160112of the canvas 160128 and then define the height 160114 of the canvas byclicking on a second position 160106 on the upper bound 160116 of thecanvas 160128 or by clicking on a second point 160104 on the lower bound160112 of the canvas 160128. In the first case the third point 160108has to be marked on the upper bound 160116 of the canvas 160128. Byfollowing these steps, the upper 160116 and lower 160112 bounds of theplanar reference canvas 160128 may be defined. Optionally more borderse.g. the left border 160124 and right border 160126 can be marked andthen defined to create a rectangular reference canvas 160128.

The correct 3D position and rotation of the reference canvas 160128 canbe calculated using information, including but not limited to the heightof the camera the content was captured with relative to the floor plane160130, the angles between the floor plane 160130 and the camera and theangles between the floor plane 160130 and the physical walls 160132.Using this information the 3D position of the intersections 160102160104 of the floor plane 160130 and the surface 160128 can becalculated. Using the same information then the 3D position of thepoints 160106 160108 and the top border 160116 of the surface 160128 canbe calculated. This approach can be used on including but not limited totriangular and rectangular reference canvases 160128. If the referencecanvas 160128 is perpendicular to the floor plane 160130 the anglebetween the floor plane 160130 and the physical walls 160132 can beautomatically received. In the same manner, all other relations betweenphysical surfaces that are that are perpendicular to each other can beused to simplify the creation process 160134 for the user. The createdreference canvases 160128 are placed all in the same virtual space andhave correct absolute sizes, 3D positions and rotations. One or multiplereference canvases 160128 can be used to measure absolute valuesincluding but not limited to distances, angles and volumes. Thesemeasurements can be done between canvases because of the absoluteposition, rotation and scale of all canvases 160128 in the same virtualspace representing the same physical space with the same absoluteproperties. This allows the system to create arbitrary virtualrepresentations of physical spaces by creating for each surface 160130160132, 160150, 160118, 160114 in the physical space a virtual referencecanvas 160128. The scale factor that maps distances and areas in thevirtual space to the physical space can be determined by providing adistance or an area inside the image with the correct value and unit ofthe physical world. This includes but is not limited to the height ofthe camera.

Additionally or alternatively, this reference canvas 160128 can then beused in the other modes. In some examples, multiple reference canvases160132 can be created and connected to reflect the complete physicalstructure of the scene. Reference canvas one 160128 can be used tosimplify the creation process of canvas two 160132 if both physicalstructures represented by the canvases are orthogonal to each other. Inthis case two points e.g. the height and right start point 160124 ofcanvas two 160132 may already be defined by canvas one 160128 and onlye.g. the length of canvas two 160132 has to be defined to create thesecond canvas.

The floor canvas 160130 can be defined by the user additionally oralternatively before defining all walls simultaneously by first markingthe exact shape of the floor 160130 and as a second step defining theheight of all walls in the same way the height 160114 is defined if eachreference canvas 160128 is created individually.

In some examples, after at least one canvas 160128 has been defined themeasurement-mode 160136 can be used to measure distances 160110 on thecanvas 160128. A start point 160110 and an end point 160110 may bemarked by the user to start a measurement for one-dimensional distances160122. In the same manner by marking the start and end point of themeasurement two-dimensional areas 160118 and three-dimensional volumes160144 can be measured on the reference canvas 160128 as well.

Additionally or alternatively, any custom text 160122 and otherannotations, drawings, colors, or other content can be placed on thecanvas 160128 as they are placed normally without a reference point in3D space or on the sphere. Using the orientation and other properties ofthe canvas 160128 the added content 160122 can be respectively alignedwith this canvas 160128.

Additionally or alternatively, in a similar way to the measurement-mode160136 the user can switch to the angle-mode 160138 and measure angles160146 on the created canvases 160132. To measure an angle, the user mayhave to define points 160148 on the canvas 160132 marking the angle.Then a 3D UI 160146 may be generated to render the defined value in the3D space as part of the virtual overlay on top of the canvas.

The projection-mode 160142 may enable the projection of a virtual sceneincluding but not limited to a 360° image of the scene 160150 onto thecreated canvas 160128. The projection may be done by aligning the edgesof the canvases with the corresponding edges on the 360° image andstretching or compressing the rest of the 360° image such that theseedges keep aligned. This can be done with all physical structures of thescene which results in a 3D reconstruction of the scene where theunderlying original context image 160150 is fully overlaid by thecreated canvases 160128. In the example, the system allows a conversionof a single 360° image 160150 into a 3D model which can be rendered withcorrect depth in the stereo mode.

Alternatively or additionally, in the shape creation mode 160140 a shape160120 which corresponds for example to the physical object (in thisexample a door) 160118, can be defined on the canvas 160132, which thencan be used as, including but not limited to a hitbox area or a 3Dobject to allow user interactions with this created shape 160120. Thisway the user can define what should happen when the created shape 160120is selected, e.g. clicked or tapped. One of multiple possible examplesis that as soon as the shape 160120 is clicked the scene switches to anew one using a command system. Another example would be that the userextracts the shape from the canvas to use it as a flat 3D model in thescene, by using the projection-mode 160142. The user could for exampleselect a window 160144 on the canvas 160128, duplicate it and place itnext to the original window to modify the scene.

Web Page Examples

Additionally or alternatively, the systems and methods here may supportweb page features FIG. 7a illustrates an example how the creator of aHolo can create an HTML page which is overlaid over the virtual scene asa special type of overlay for any scene 122002. This overlay 170101 maybecome visible on top of the virtual scene when the users open the scenelater in the player mode. This way any HTML page 170101 with allfeatures supported by HTML and all types of rich content 170102including but not limited to text, images videos and/or other 3Drenderings using WebGL can be placed on top of a virtual scene 122002.

In such examples, the overlaid HTML page 170101 can communicate througha javascript API with the underlying Holo to control and receiveinformation and execute tasks including but not limited to switching thecurrent scene 122002 or rotating the virtual camera when a specificmethod in the overlaid HTML page 170101 is called.

This technique allows for customization. A few non-limiting exampleswould be creating custom 2D geographical maps as overlays over thevirtual scene 122002 showing up after a video 170102 inside the HTMLpage 170101 has ended, or the content of the HTML page 170101 changingafter a user with a head-mounted display such as the cardboard 400202turns towards a certain direction.

Annotation Examples

Additionally or alternatively, the systems and methods here may supportannotation features. FIG. 8a shows an example UI representationdisplaying an annotation which alternatively or additionally may be usedwith the systems and methods described here. An annotation may be anentry in an either global or local annotation list 180112. An annotationmay be associated with one or more objects 180104 in a Holo 180102,which include, but are not limited to 3D objects 180106 and text 180108.Moreover, an annotation may optionally be associated with one or moreusers 180110. An annotation can represent either a task to be done bythe associated user(s) or an arbitrary type of note in either textual orvisual form.

Additionally or alternatively, FIG. 8b depicts an example diagram of theprocess flow for a creator of a Holo, or another user with sufficientaccess rights, wanting to create a new annotation by choosing thecorresponding option 180202 in the Holo editor on any devices including,but not limited to, devices described in the FIG. 29d , FIG. 30a .Subsequently, in no particular order, the creator or user may specifythe associated object(s) within the Holo 180208, specify the associateduser(s) 180206 and describe the annotation either textually or visuallyas it should appear in the annotation list 180204. In the example, thespecification of the associated object(s) and description of theannotation may be required while the specification of the associateduser(s) may be optional. Associated users can either be existing usersof the system or external persons that can be identified using a globaldigital identifier like an e-mail address. Once the necessary parametersare present, the annotation may be added to the global or localannotation list 180210, according to the creating user's choice. In caseone or more associated user(s) have been specified, they are notified ofthe new annotation 180212, e.g., via e-mail, and granted access rightsto the corresponding annotation list.

Additionally or alternatively, FIG. 8c shows a process flow diagram forcreating a new annotation. An annotation can be created by the creatorof a Holo, or another user with sufficient access rights, after havingselected one or more objects within the Holo using any systemsincluding, but not limited to, devices described in FIG. 29d , FIG. 30a. With the object(s) being selected, the creator or user may choose theoption for creating a new annotation 180302. Subsequently, they maydescribe the annotation either in textual or visual form as it shouldappear in the annotation list 180306 and optionally specify one or moreassociated users 180308. Once the necessary parameters are present, theannotation may be added to the global or local annotation list 180310,according to the creating user's choice. In examples where one or moreassociated user(s) have been specified, they are notified of the newannotation 180312, e.g., via e-mail, and granted access rights to thecorresponding annotation list.

FIG. 8d illustrates an example process flow diagram after a newannotation has been created. The annotation process can additionally oralternatively be performed on a device including, but not limited to,devices described in the FIG. 29d , FIG. 30a , and added to either aglobal or a local annotation list. All users associated with theannotation may be notified 180402. Subsequently, they can access thecorresponding annotation list 180404. When accessing the individualannotation, they may automatically be forwarded to the Holo containingthe associated objects with the focus on these objects, as illustratedin FIG. 8e . With the given information they proceed however necessary180406 and can afterwards mark the annotation as resolved 180408.

FIG. 8e depicts an exemplary view which can be seen using any deviceincluding, but not limited to, devices described in the FIG. 29d , FIG.30a , when accessing an individual annotation either from acorresponding annotation list 180508 or directly from a notification, orwhen selecting 180510 one or more objects in a Holo 180504 180506 thatare associated with an annotation 180512. The annotation may be showndirectly within the Holo in its either textual or visual form, spatiallyplaced at the perceptibly the same or similar 3D position as theassociated objects. In case the associated user was forwarded to theHolo from an annotation list or notification, the associated objects areautomatically focused.

FIG. 8f illustrates an exemplary view when marking an annotation using adevice including, but not limited to, devices described in the FIG. 29d, FIG. 30a as discussed. In the example, directly within the Holo, theassociated user can mark the annotation as resolved, e.g., by clicking acheckbox 180614 or similar. The new state of the annotation may thenautomatically be propagated to the corresponding annotation list 180608in some examples, in real time. This synchronization may be of a two-waynature. That is, indicating that an annotation is resolved within thecorresponding annotation list affects the displayed state of theannotation within the Holo in real time as well. In case an annotationrepresents a task, it is globally marked as resolved as soon as oneassociated user has marked it as resolved. In case the annotation doesnot represent a task, it may be globally marked as resolved as soon asall associated users have marked it as resolved. An example for thelatter case is an important notice that must be read by a specified listof persons.

FIG. 8g shows the scope of a global annotation list example comprisingseveral Holos 180702. That is, additionally or alternatively, the listcan contain annotations for all objects within Holos that lie withinthat scope. In contrast, the scope of a local annotation list may belimited to a single Holo 180704. The type and scope of an annotationlist may be defined by the creator of the list. An annotation list canbe created either on the fly after having created a new annotation180210 180310 or using a dedicated, separate interface.

Painting Examples

Additionally or alternatively, the systems and methods here may supportpainting experiences FIG. 9a depicts an example set of features may beused such as painting tool for Holos and 360° images which may becreated by a spherical or panorama image including but not limited todevices described in the FIG. 29a and FIG. 29b . The painting tools mayprovide the user with a way of creating arbitrary free-form strokes190104 190106 directly within a Holo comprising a 360° image 190102which also can be done including, but not limited to, the devicedescribed in the FIG. 29d . Any array of painting tools may be providedsuch as those found in another painting program such as multiple colors,shades, patterns and textures, as well as various virtual paint brushsizes, shapes, as well as virtual pens, pencils, erasers, etc. Thestrokes may be painted directly onto the surface of the 360° image, asrendered onto the inside of the sphere in the three-dimensional scene,i.e., the Holo, by the user after selecting the free-form painting toolfrom a corresponding interface. The painting process may be carried outusing human-computer interfaces such as, but not limited to, a computermouse and screen 400110, hand held pointers, joysticks, or othercontrollers, or a touch screen, FIG. 29d . The free-form strokes canhave arbitrary thickness and color, both of which can be determinedusing a corresponding interface. Particularly, in some examples,free-form strokes can be annotations in the sense of FIGS. 8a, 8b, 8c,8d, 8e, 8f and 8g , and/or annotations in the sense of FIG. 10 a.

FIG. 9b illustrates example painting tools for Holos and 360°-imagesproviding the user with a way of integrating predefined geometricfigures 190204 190206 190208 into a Holo comprising a 360° image 190202.Predefined geometric figures include, but are not limited to rectangles190204, squares, diamonds 190206, ellipses, and circles 190208. Thegeometric figures may be painted directly onto the surface of the 360°image, as rendered onto the inside of the sphere in thethree-dimensional scene, i.e., the Holo. They may be placed by the userafter selecting the respective painting tool (e.g., rectangle, diamond,circle) from a corresponding interface. The process of placing thegeometric figure and specifying its size may be carried out usinghuman-computer interfaces such as, but not limited to, a computer mouseand screen 400110 or a touch screen, FIG. 29d . The geometric figurescan have arbitrary border thickness and color, both of which can bedetermined using a corresponding interface. Particularly, the geometricfigures can be annotations in the sense of FIGS. 8a, 8b, 8c, 8d, 8e, 8fand 8g , and/or annotations in the sense of FIG. 10 a.

Multi-User Examples

Additionally or alternatively, the systems and methods here may supportmulti-user experiences, with objects in the Holo that have avatarfeatures of one or more users. FIG. 10a illustrates an example view of auser in the player mode which part of the multi-user-experience, on adevice including, but not limited to, devices described in the FIG. 30a. In the example, a user can invite other users 200101 to amulti-user-experience in any created Holo. As such, users 200101 whichjoin this experience may see the same augmented and virtual content122002 and additionally other users 200101 in the scene, on a deviceincluding but not limited to devices described in the FIG. 30a . Awebcam feed 200102 may be used in certain embodiments to see either avisual representation of the user, or another virtual representationlike a virtual avatar, which can be chosen by a user, or be arepresentative or live feed of their face or body. Additionally oralternatively, audio, can be used in some examples for a naturalcommunication between all users 200101 and the spatial position of theaudio is the same as the position of the virtual representation as theother user 200101 to hear the voice of this other user 200101 from thecorrect direction and distance based on the user's own position andorientation. In normal 3D scenes where the camera 400102 can move freelyand is not locked in the center of the scene as in 360° scenes, theother users 200101 webcam feed 200102 can be shown at the location wherethis user's virtual camera is located at a given time. The orientationof the other users 200101 may be projected on their avatars to give anunderstanding where the other users 200101 are looking. Users 200101joining the multi-user-experience may obtain certain rights, forexample, to add and draw annotations 200103 in the 360° image/video,which can be created by a spherical or panorama camera including but notlimited to devices described in the FIG. 29a , which may then besynchronized among all connected clients. Such annotations can inparticular be annotations in the sense of FIGS. 8a, 8b, 8c, 8d, 8e, 8fand 8g . Additionally or alternatively, users may paste and place textand other rich content 200104 in the scene. The augmented or virtualscene 122002 may provide the context and the result of the collaborationsession is a Holo with the annotations, text, images, links and othercreatable content, that can be saved as a new branch of the originalHolo with which the session started.

FIG. 10b shows an example if a multi-user-experience contains a 360°scene 200202 with a fixed spatial position 200204 of the virtual cameraand only its orientation set by the users 200101. In such an example,the virtual representations of the users 200101 may be placed at thepositions 200205 in the 360° scene where the other users 200101 arecurrently looking. The orientation of the avatars of the other users200101 may face to the center of the sphere 200204. The user 200204 mayonly see the users in his current field of view 200203 like with anyother virtual content in the 360° scene. If many users 200101 arelooking at the same location, their avatars may all be at the samelocation 200205. To only show specific avatars and making the otheravatars less present in the scene, the user can select the relevantavatars 200208, for example a single presenter 200206 who presents to alarge audience, through a selection UI. This selection UI can be forexample a separate list 200207 or as another example he picks the targetcharacters using a magnifier effect to separate them from the unwantedsurrounding avatars. In some examples, by default, if there is a largeaudience but only a very small subset of this audience 200208 has editrights to the Holo, the experience happening in the subset can beselected 200208 as the default selection of highlighted users. In someexamples, users with edit rights 200208 to the Holo they are in canchange the scene permanently and all other users 200101 may see thesechanges automatically. Some examples may allow users to exchange textualinformation such as links and other textual content an optional chat box200209. In such examples, users with the authority to post informationcan exchange this information in the system or by using a third partysystem.

In multi-user examples, users can join the multi-user-experience in anynumber of ways including but not limited to receiving and opening a linksent by the creator of the session. Logins, links, or other ways may beused as well. In such examples, the session can be either protected by apasscode to allow private sessions with only limited access for userswho know the passcode, or public where it is accessible by any user tocreate digital open spaces for information exchange. This way there maybe multiple sessions in the same virtual space without interfering witheach other.

Single users can have the management rights to author the created multiuser experience session and can have the power to for example mute, hideor ban other users from the session.

Audio Examples

Additionally or alternatively, the systems and methods here may supportaudio features and/or experiences including using various audio channelsin user displays, headsets and computing devices. FIG. 11a is anillustration that visualizes example audio sources 210104 with a 3Dposition in a virtual scene 210102. The virtual scene 210102 can both beviewed and created with a technical device 400106. This method can embedaudio sources 210104 in virtual scenes 210102. Example audio sources210104 can have a 3D position in a scene 210102. In examples where asound is intended to be perceived the same from every position andorientation in the scene, it does not need to have a 3D position in thisscene. Adding audio in a three dimensional space allows users to hearsound from certain positions. Such sound may depend both on the positionand orientation relative to the audio source. The relative position tothe audio source may determine the total volume level that could beheard, while the orientation may determine the amount of volume that isplayed on two audio channels separately, which is less or equal to thetotal volume level. This method also supports audio sources withoutconsidering the distance to the audio source, e.g. by using the samedistance for all audio sources. Multiple audio sources 210104 may besupported at the same time, making it possible for a user to heardifferent sounds from different directions with different volume levels.An audio source can either be played all the time, or be triggered bycertain events. Such methods can also be used to annotate differentelements in a scene with audio annotations which could be triggered ifthe viewing user later clicks the virtual element the audio annotationis attached to.

FIG. 11b is a flow chart that visualizes an exemplary import process foraudio sources. A user 210202 uses a computing device 400106 to eitherstart by choosing the scenes or by choosing the audio source first.

Example Choice 1: A user 210202 can start by choosing scenes 210204 inwhich the audio source will later be embedded. After choosing one ormore scenes 210204, the user 210202 may choose a position 210206 foreach scene that was chosen in the previous step. For reasons ofsimplicity, the same position can be used for multiple scenes. In suchexamples, after a position is chosen for every scene, the actual audiosource can be chosen. The audio source can either be an existing file onthe user's 210202 device 400106 or can be recorded 210208 using amicrophone. After this step, the audio source is successfully embeddedin the scenes 210212.

Example Choice 2: A user 210202 can start by choosing the audio sourcefirst. The audio source can either be an existing file on the user's210202 device 400106 or can be recorded 210208 using a microphone. Afterchoosing the audio source, the next step is to choose the scenes 210204in which the audio source will later be embedded. After choosing one ormore scenes 210204, the user 210202 has to choose a position 210206 foreach scene that was chosen in the previous step. For reasons ofsimplicity, the same position can be used for multiple scenes. Afterthis step, the audio source is successfully embedded in the scenes210212.

Embodiments may allow for users to utilize both choice 1 and choice 2 ina Holo or scene as described herein. Describing the two methods is notintended to be limiting in any way.

360° Live Streaming Examples

Additionally or alternatively, the systems and methods here may support360° live streaming. FIG. 12a shows an example of how the creator of aHolo can use a 360° live-stream 220101 instead of a single 360° photo orvideo as the context for the virtual scene. Using a 360° webcam FIG. 29a, including but not limited to the Ricoh Theta, makes it possible tostream a 360° live feed 220101 like it would be possible with any normalwebcam 400108. This enables the user to apply this 360° live stream220101 as the context for a virtual scene. Additionally, the 360° livestream 220101 can contain all the other 3D content as any other single360° photo or a 360-° video Holo scene.

One possible implementation example is one user streaming live from aremote location while other users 200101 use the live collaborationfeature to talk to this user and annotate content 200103 in the livestream 220101 like they would in any other Holo scene. Annotated contentcan be the same content as for any other multi user experience liketext, temporal drawings 200103 or other rich content 200104. This waythey can annotate the content temporarily visible in the scene and theuser streaming the feed 220101 can see these annotations in his virtualscene as any other user 200101 can.

In such examples, if a user 200101 enters a scene where there is no livestream 220101 available, the creator of the Holo can specify a fallbackaction like displaying a placeholder 360° image or 360° video instead ofthe unavailable live stream 220101. This functionality can also be usedfor finite video streams 220101 to play the recorded livestream afterthe streaming has finished. In such cases the streamed video may beautomatically recorded while the streaming is happening and thisrecorded file automatically specified as the fallback 360° video contentas soon as the finite stream 220101 has finished.

Zoom Examples

Additionally or alternatively, the systems and methods here may supportzooming displays. FIG. 13a illustrates an example fully zoomable VR/ARenvironment of a panoramic or spherical image 230104 with an undistorteddisplay of several pieces of visual 2D/3D content 230106 on a visualdisplay system 230102. The panoramic image 230104, which may be createdby a spherical or panorama camera FIG. 29a /FIG. 29b , is defined,either manually by the user or in an automatic process, for exampledelivered by the camera device FIG. 29a-c . In such examples, theindividual pieces of the 2D/3D content 230106 may overlap and occludeone another from the view of the user. The visual display system 230102can be viewed through any of multiple display devices for example butnot limited to those in FIG. 30a -f.

Distortion Examples

Additionally or alternatively, the systems and methods here may supportdistortion of displays. FIG. 13b and FIG. 13c depict two exemplaryfisheye distortion views 230208 of an undistorted VR/AR environment230104: the circular fisheye view 230202 and the Cartesian fisheye view230302. In a fully zoomable VR/AR environment 230104 of a system todisplay VR and AR environments 230102 holding an undefined number of2D/3D content 230106 one or more objects 230204 of the inserted 2D/3Dcontent 230106, as shown in FIG. 13a , might overlap/occlude oneanother. The inserted 2D/3D content 230106 might, e.g., be a live webcamstream of users working on a VR/AR environment 230104 simultaneously.With the presented example method of the fisheye distortion 230202230302, the user can focus his/her view on a certain object of interest230206. If the named object of interest 230206 is occluded by individualobjects 230204 of the geometrically related 2D/3D content 230106, theuser can use the fisheye selection process, e.g. 230202 230302, toseparate close objects and choose the one he/she is interested in230206. In the resulting fisheye view 230202 230302, the visualizationsof the related 2D/3D content 230208 will up- or downscale dynamicallywith the curser movement of the user. Clicking on one of thesevisualizations, may invoke the action related to this object 230206.

FIG. 13d illustrates a flowchart showing example individual steps of anexample selection process 230400 of the method for transitioning betweenviews of visual 2D/3D content in VR/AR environments 230104, 230208. Theexample in FIG. 13d includes a number of process blocks 230402-230414displaying the exemplary flow of the process 230400. Though arrangedserially in the example of FIG. 13d , other examples may reorder theblocks, change one or more blocks, and/or execute two or more blocks inparallel using multiple processors or a single processor organized astwo or more virtual machines or sub-processors.

At 230402, a visual content is displayed in an existing Holo asundistorted visualization of the 2D/3D content. Pieces of the visualcontent overlap and occlude the piece of interest. At 230404, the usertriggers a request to focus on an individual at least partially occludedpiece of the undistorted content visualization. At 230406, theundistorted content visualization is converted to a distorted projectionfocusing on the user's piece of interest. At 230408, the pieces in thedistorted projection are decreased in size according to the geometricproximity to the piece of interest. The down-sized pieces highlight thepiece of interest and invoke a fisheye view on as described at 230410.While highlighting a certain piece of content, the user may interactwith the content according to its individual interaction possibilities.At 230412, the user requests to change back to the undistorted contentvisualization from the distorted content projection. At 230414, afterchanging back to the undistorted visualization, the downsizing effect onthe distorted content projections may be decreased. The contentvisualization in the example appears without distortion effect.

Loading Examples

Additionally or alternatively, the systems and methods here may supportvarious loading examples. FIG. 14a illustrates a VR/AR environment122002 during the loading process of the AR/VR environment in the playermode 122000 of an AR/VR editor. The panoramic image 122002, which iscreated by a spherical or panorama camera FIG. 29a /FIG. 29b , isdefined, either manually by the user or in an automatic process, forexample delivered by the camera device FIG. 29a-c . The visual displaysystem 230102 can be viewed through multiple display devices FIG. 30a-f.

While the content of the VR/AR environment 122002 is being loaded in theplayer mode 122000 a loading screen 240101 may be shown to the user forthe duration of the loading process of the VR/AR environment 122002. Inthis example, the loading screen 240101 may display a quick instruction240102, 240103 for the user on how to interact with the VR/ARenvironment 122002. The content 240102, 240103 of the loading screen240101 may differ according to the usage scenario and is not limited tothe in FIG. 14a illustrated exemplary implementation.

If the scene is loaded in a device like a head-mounted-display orvirtual reality headset which supports rendering the loading screen as a360° sphere around the user, the full sphere can be used to display theintermediate content like the instruction example 240102 and otherinformation about the loaded scene or a placeholder graphic, video orother rich content which is shown until the full scene 122002 is loadedand ready to be displayed.

Additionally or alternatively, the semi-transparent or semi-transparentdesign can be configured as, e.g., animated instructions or as a loadingprogress bar displaying the animated loading screen. Furthermore, aprogram logic can be provided, which only shows a loading screen 240101while loading the VR/AR environment 122002, if the duration of theloading process actually exceeds a predetermined loading time. Forexample, the program can be designed so that the loading screen 240101is only displayed when the loading process lasted longer than forexample 250 milliseconds.

Face Detection Examples

Additionally or alternatively, the systems and methods here may supportface detection features. FIG. 15a shows an example system configuredwith automatic face detection in the VR scene. The automatic facedetection features set may work with many types of input 250110,delivered by a device including but not limited to devices described inthe FIG. 29a , FIG. 30a . Such examples maybe a panoramic image 250112,a panoramic video 250111, or other virtual reality scenes 250114,including but not limited to a 3D mesh with a texture projection. Theinput 250110 may be analyzed automatically 250116 by the system. Iffaces are detected on the input 250110, they may be blurred 250120automatically, if programmed to do so. Furthermore, facial expressionsmay be analyzed 250118, so that the emotions present in the scene 250122can be analyzed. The original input 250110 may not be stored in thedatabase. In such examples may be the original input 250110 may not beaccessed later on.

Stability Examples

Additionally or alternatively, the systems and methods here may supportstabilizing features. FIG. 16a is a flowchart illustrating an exampleautomatic dynamic rendering resolution adjustment to keep a stable framerate using a technical device 400106. The system in this exampleautomatically adjusts the rendering resolution 260101 according to thecurrent frame rate 260102 of the rendering system 260103.

In some examples, systems with limited processing capability may not beable to display higher resolution images at the frame rate that ispossible. The systems here may alter the rendering resolution 260101 bydropping to a defined minimum resolution 260104 until the framerate260102 reaches a defined range 260105 of acceptable frame rates. Thisprocess may be repeated and measured to react on temporary reduction inframerates 260102 due to possible loading procedures which can occur atany given time. Such a loading process of additional content or anyother computationally expensive calculations may then reduce therendering resolution 260101 temporarily until the process is finishedand the resolution 260102 is increased again.

The system may automatically employ these methods for devices withhardware which is not as capable of rendering complex 3D scenes as adesktop computer 400122 might be. Slower devices including but notlimited to mobile devices 400102 or Head Mounted displays 400300 maybenefit from this system which may cause the resolution to be reducedautomatically until a constant acceptable frame rate is reached.

Fragmentation and Sphere Examples

Additionally or alternatively, the systems and methods here may supportdisplay fragmentation and/or segmentation of a spherical image.Segmentation may refer to analyzing an image and dividing or segmentingthem into logical shapes that may be designated using any of variousalgorithms. The algorithms may then be used to identify certainsegmented shapes which may be analyzed, found, counted, loaded in order,etc. For example, an algorithm may identify and segment all windows in aroom. These windows may have a consistent shape or color and may beidentified by the system through image analysis. Then, over manymultiple Holos, the system could count the number of windows andidentify where they are located on a Floor Plan.

The higher resolution the image, the more accurate the segmentation maybe. This can allow compression in more interesting parts, shapes thatare designated, may be loaded more quickly or first before the otheraspects of the image. Processes may be sped up in this manner. Otherexamples include finding shapes using algorithms in images. The higherthe resolution, the more accurate the segmentation, the more accuratethe shapes may be later found.

Fragmentation may refer to breaking or fragmenting an image into adistinct pattern such as a grid to be split up for faster loading.Instead of identifying specific shapes in an image, fragmentation maymerely apply a repeatable pattern to an image to break it into chunkssmaller than the entire image which may be loaded in turn. This may saveon computing resources as blocks or fragmented portions that are moredesirable to load first are displayed before less interesting fragments.

An example of fragmentation is shown in FIG. 17a which shows an exampletiled 360° image which comprises several distinct parts 270104 inoriginal resolution that have been created by segmentation of theoriginal 360° image 270102 which was taken with a spherical cameradevice 270101 which is described in detail in FIG. 29a . The number oftiles may be changed in the context of the systems and methods here. Incertain examples, the number of tiles chosen may be based on analgorithm taking into account the dimensions and resolution of theoriginal image.

In some embodiments, the number of supported devices may be increasedwhen the following two requirements are met by the algorithm: the numberof tiles is a power of two and the tiles have a resolution of at most2048×2048.

FIG. 17b describes an example structure of an example sphere. Upondisplaying a Holo comprising a 360° image, the systems and methods heremay overlay a low-resolution version of the 360° image 270202, which wastaken with a spherical camera device 270101 which is described in detailin FIG. 29a . This image 270202 may be loaded first, with any of varioushigh-resolution tiled versions of the same image 270204 or portions ofthe same image. The individual tiles 270206 may be loaded iterativelyand asynchronously to the server. As soon as all tiles are present, thelow-resolution version of the 360° image may be completely covered andthus removed.

FIG. 17c describes an example of how, on the client side 270302 using acomputing device 400106, a user can upload the original version of a360° image 270304. In a preprocessing component, a low-resolutionsingle-tile version 270306 as well as a high-resolution multi-tileversion 270308 may be computed from the original image. Transmittingover a communication network, e.g., the Internet, the low-resolutionversion as well as the individual tiles may be transmitted to a server270310. That server may be responsible for storing all transmitted datato a persistent data store 270312. It should be noted that allpreprocessing steps, particularly including the computation of thelow-resolution single-tile version and the high-resolution multi-tileversion of the original 360° image, may happen on the client 270302while the server 270310 may solely be responsible for communication withthe persistent data store as well as receiving and delivering thedifferent versions of the 360° image.

FIG. 17d , which partly corresponds to FIG. 17c , describes how systemsand methods here may first receive the original version of a 360° imagefrom the user 270402 taken with a spherical camera device 270101 whichis described in detail in FIG. 29a . Subsequently, in a parallelprocess, a low-resolution single-tile version 270404 as well as ahigh-resolution multi-tile version of the 360° image 270406 may becomputed. Subsequently, the low-resolution version as well as all of thecomputed tiles may be transmitted to a server 270408 and saved to apersistent data store 270410.

FIG. 17e describes an example of the sphere loading process. When a userrequests a Holo comprising a 360° image from the server 270502, first,the previously computed low-resolution version may be transmitted to theclient's computing device 400106 and displayed to the user 270504. Afterthe low-resolution version is displayed the user can start using thesystem. In some embodiments, only after to low resolution version isfully loaded, the distinct tiles of the high-resolution version 270506may be transmitted to the client's computing device 400106 in anasynchronous manner 270508. As soon as all individual tiles of thehigh-resolution version have been successfully transmitted by the server270510, the low-resolution version of the 360° image may be completelycovered and can be removed.

Object Examples

Additionally or alternatively, the systems and methods here may supportdisplay objects. FIG. 18a shows an example functionality for adding anarbitrary object 280104 including, but not limited to, text, 2Dgraphics, 3D objects and annotations, located on the client side 280102.In such examples, also on the client side the functionality forcomputing a hash value may take place, e.g., using an algorithmincluding but not limited to MD5, for an object to be added to a Holo280106. Communication with the server side 280108 may happen via acommunication network, as described herein, e.g., the internet. In suchexamples, the server may be responsible for storing objects and hashvalues 280110 to and retrieving them from any of various data stores280112, 400106. Objects may persist in that data store 280112 along withtheir hash values 280110, whereas each pair of hash value 280110 andassociated object may be present in the data store 280112 once and onlyonce.

FIG. 18b depicts an example process of uploading an object to be used ina Holo and testing for an existing hash value starts with receiving theobject from the user 280202 from an arbitrary device FIG. 30a .Subsequently, a hash value for that particular object may be computed280204 on the client side using a computing device 400106. Bycommunicating with the server and transmitting the computed value, it ischecked whether the particular hash value is already present 280208 inthe data store 280112 from the FIG. 18a by performing a correspondingsearch 280206. In example cases where the hash value is already presentin the data store 280112, the user's object may be directly added to theHolo 280212 without further communication with the server. In examplecases where the hash value is not yet present in the data store 280112,the user's object may be transmitted to the server and persisted to thedata store 280112 along with its associated hash value 280210 beforeadding the object to the user's Holo 280212.

Processing Examples

Additionally or alternatively, the systems and methods here may utilizedifferent processing techniques. FIG. 19a shows an example process thatis triggered by a user, when she imports any 3D model file 111800, byany method including but not limited to drag it into an existing virtualscene UI. Such interaction may cause the model import process 290101 tolocally start reading the model file 290102 in the memory 290108, 400114of the creators 290112 device 290107, 400102, 400122. This may allowfurther parsing 290103, conversion 290104 and processing 290109 of themodel to customize it for the different target watchers 290110 it islater shown to. This process may be a completely local series ofcalculations which do not need a connection to the server 290105 or anyother sources. This may enable the system to be used in offlinescenarios and protect the user's IP, since no sensitive information maybe passed to the backend. This may be useful when using complex CADdocuments including sensitive product information to extract the 3Dmodel data 290102 from. The final result of the import 290103,conversion 290104 and customization process 290109 can be uploaded tothe server 290105 when an internet connection is available. Theresulting model 290111 may be included in the creators 290112 list ofimported 3D models 290106.

With the described methods here, the performance of the process may relyon the creator's front end device 290107, FIG. 30a , e.g. the computer400122 the user is using to create the virtual scene, it is executed on.Therefore, the method may decouple the process of the availablenetworking speed making the only possible bottleneck the hardware290107, 400106 of the creator, e.g. the CPU 290113, 400112 and theavailable RAM 290108, 400114. Additionally, due to the decentralizedprocessing on the client's devices the backend 290105 cannot become abottleneck as a result of too many simultaneous requests. Any number ofwatchers 290110 can use the converter 290104 simultaneously without awaiting queue.

The example import process 290101, i.e. parsing 290103, conversion290104 and simplification 290109, are performed in the background and donot block any user interactions with the system while running. Thisasynchronous pipeline 290101 allows importing multiple models 290102simultaneously. The progress of each import process 290101 can bevisualized and returned to the creator 290112 as feedback while thecreator can continue to work on the virtual project.

During an example import process 290101 the model 290102 can becustomized 290109 for the different target devices as described in theFIG. 20a . During customization 290109, textures may be resized 300107as described in the FIG. 20a to be usable on mobile devices 400102.Furthermore, as a preparation for real time mesh simplification on theclient rendering the virtual scene the model mesh may be brought intothe correct order to be able to perform the mesh simplificationalgorithm 290109 in real time while the virtual scene is loaded in theplayer. The process 290109 may be executed during the creation processto allow adjusting the rendering quality to the performance of thewatcher's device 290110, FIG. 30 a.

In some examples, when rendering 3D content 300101 on mobile devices300102, FIG. 30a like smartphones 400102 the hardware may be limited,for example, they may have limited graphics processing (GPU) 300103,400118 capabilities. Simultaneously loading many highly detailed 3Dmodels 300101 with large texture maps 300104 which are targeted fordesktop GPUs 400118 may not be feasible on mobile GPUs 300103, 400118.Future developments may help with these situations, allowing processingto take place on any of various devices including mobile devices.

FIG. 20a illustrates an example using an automatic process to reduce thedetailed 3D models 300101 to more simple models which originally mightinclude hundreds of thousands of vertices and meshes. The example methodmay reduce this number automatically when the model 300101 is loaded ona device 300102, 400102, 400122 with a mobile GPU 300103, 400118 e.g. aSmartphone 400102. Objects which include multiple meshes can be mergedto a single mesh during this customization process to reduce theworkload for the GPU 300103, 400118 furthermore.

Additionally, the original high resolution texture maps 300104 mayautomatically be shrunk down to a second texture 300105 with reducedsize which may consume less memory and load faster into the mobile GPU300103, 400118 memory than the original texture arrangement 300104.

Such a combination of mesh reduction 300108 and texture reduction 300107may be applied on the mobile device 300102, 400102 itself the first timethe model 300101 is loaded and then the simplified version 300106 iscached internally on the device 300102, 400114 for reuse when the model300101 is requested the second time. This may increase the loading timesignificantly the second time the customized model 300106 is loaded incomparison to loading the original model 300101.

Rotation Examples

Additionally or alternatively, the systems and methods here may supportvarious display rotation techniques. FIG. 21a is a diagram thatdescribes an example method to apply a user's rotation to videos,including but not limited to fully spherical videos, instead of theoriginal rotation the camera 310102 400108 had when it took the image310103. In some embodiments, multiple images can be used as long as theycan be sorted in a consecutive order. Those multiple, consecutiveimages, can come from one or multiple sources. Each image 310103 may beprocessed by a visual odometry system 310106 340102 resulting in thecamera pose 310108. Odometry may refer to the estimation of positionalchange over time based on any number of data sources including images,motion sensors, known location anchors, updated location information,etc. The odometry subsystem may be used to inform the systems andmethods here to estimate camera and object positions as well as updateand estimate positions relative to one another and the camera systems.Odometry may also refer generally to measuring distances traveled overtime by any object or camera.

Augmented Reality Using Camera Images

As explained, Augmented Reality AR may utilize images captured by acamera, and import computer generated graphics into the camera images(e.g. computer generated digital graphics other than the camera imagewhich is technically computer generated by the camera itself.)

The camera pose 310108 may include the translation and rotation 310110the camera 310102 400108 had when it took the image 310103. Knowing thecamera rotation 310110, the image 310103 may be processed such that therotation 310110 is removed from the image 310103. For the resultingimage 310112, no matter how the camera was oriented, afterwards allimages would be directed to the same direction. The user 310104 can lookat this video with a technical device, including but not limited tovirtual reality headsets 400300. The user 310104 can rotate and decidewhich part of a video is to be shown independently from the camerarotation 310110. The user's rotation 310114 can be applied to the image310112 where the rotation was removed. This may result in an image310116 that is oriented in the same or similar way the user 310104 isoriented. The user's rotation 310114 and the field of view determine theimage 310118 on a screen, which is typically only a part of the fullimage.

FIG. 21b illustrates an exemplary situation where the camera 310206 andthe user 310208 are rotated to the same direction. Outside of VirtualReality, this is the common situation for videos and images. In thissituation, a user 310208 cannot rotate. Instead, she adopts the camera'srotation which means the user 310208 always looks in the same directionas the camera when it shot the frame 310212 the user 310208 currentlysees. For panoramic or spherical videos, the user usually only sees asubset of the video 310212 at each point in time because humans do nothave a 360° field of view. Only objects inside the field of view 310218of both the camera 310206 and the user 310208 may be seen in thedisplayed image 310212. In the provided example, the cube 310202 isoutside of the field of view 310218, while the photo 310214 is inside ofthe field of view 310218. Therefore, the user sees the photo 310210 onhis display 310212 while he does not see the cube 310216.

FIG. 21c illustrates an exemplary situation where the camera 310306 isrotated differently than the user 310308. Especially in case of virtualreality scenarios, it may be desirable that the user 310308 to be ableto rotate independently of the camera 310306. In this example, theenvironment the camera 310306 is in includes two objects: a cube 310302and a photo 310304. The user 310308 in the example, sees the projectedcube 310310 on his display 310312 but no projection of the photo 310304although the camera 310306 is oriented in the direction of the photo310304.

FIG. 21d illustrates an exemplary situation to make clear howstabilization is beneficial. A cameraman 310404 is usually affected byunwanted motions, including but not limited to up and down movements forhandheld cameras 400108 while walking. When a cameraman 310404 capturessomething, e.g. a person, those unwanted movements can affect the imagehe captures. This can result in movements in the captured images 310408310410. The presented method will reduce or even eliminate potentialunwanted movements in the captured images 310402. A virtual cameracorresponds to the user's view, meaning that the virtual camera willfollow the user's movements. The virtual camera can be decoupled fromthe physical camera which means that it doesn't follow the movements ofthe physical camera. Decoupling the virtual camera from the physicalcamera can be achieved by applying the position and rotation provided bythe odometry system to the virtual camera. The virtual camera determineswhich part of the image is projected onto the display 310312.

Depth Examples

In AR situations where cameras are used to capture an initial image, adepth of each pixel may be useful in constructing an AR scene.Additionally or alternatively, the systems and methods here maycalculate and utilize various depths in different ways. FIG. 22a showsan example depth estimation 320112 of each pixel in a stereoscopicpanoramic image 320114, which may be created by a spherical or panoramacamera including, but not limited to the depiction in FIG. 29a , withrespect to a camera 320106, 400108 as illustrated in FIG. 30a , can becalculated by using at least two panoramic images 320102, 320104 thatshare the same features of a scene and thus can be used for depthestimation using including but not limited to optical flow and a camera320106, FIG. 30a , 400108 that is facing toward each panoramic image320110, 320108. In one non-limiting example, depth estimation may bedetermined based on pixel triangulation of the same feature in the twopanoramic images and a third point such as camera position and/orheight.

Optical flow may refer to a more complicated algorithm beyondtriangulation of pixels and may be further improved using other sensordata. For example, if the camera was moved or two frames were moved, thedepth may be estimated using the movement and distance estimations.

Sensors data may be used in these calculations to augment the pixeltriangulation.

The extracted depth information 320102 may be used to render thepanoramic scene 320114 in a stereoscopic device 320122, including butnot limited to devices described in the FIG. 30d , FIG. 30e , with acorrect sense of depth using the extracted depth information 320102instead of rendering the panoramic images 320102, 320104 side by side onboth eyes of the user. Such example embodiments may enable the user torotate his or her head arbitrarily without destroying the immersion of3D depth of the scene 320114. It additionally enables the user to movethe point of view, the head of the user 3320122, in a low range and thisway allow a certain degree of positional movement in the reconstructedscene 320114.

Enhancement Examples

Additionally or alternatively, the systems and methods here may supportdisplay enhancement of Holos. FIG. 23a shows how a video can be enhancedby additional example elements. A camera 330108 400108 captures itssurrounding environment and creates a video showing the surroundings. Asan example, the surrounding could contain a cube 330106. One exemplaryframe 330102 of the video could show this cube 330104. If the camera330108 400108 moves relative to the cube 330106, the cube 330112 in theframe 330110 should be at a different position compared to the cube330104 in the frame 330102. The frame 330110 can be viewed on atechnical device 400106 400300. Systems and methods here can positionadditional elements in a three dimensional space that is afterwardsshown as if it were part of the video. As an example, a speech bubble330120 can be positioned above a cube 330104. When the original cameramoves, the cube 330104 in frame 330114 could become the cube 330112 inframe 330122 in the video. In such examples, the cube moved accordinglyto the movement of the camera and its position relative to the camera.This method can be used to calculate how the camera moves, therefore theadditional elements can also move accordingly. In the example, thespeech bubble 330124 would have a different position relative to theuser 330118 for the second frame 330122, in contrast to the speechbubble 330116 that is used for the first frame 330114. In such examples,not only would the speech bubble 330128 stay on top of the cube, or anyother designated object, it can even change its displayed sizeperspectively correct when the camera moves closer to or farther away.

Object and Camera Tracking Examples

In some AR examples, a camera may change its field of view by panning,zooming, or otherwise moving in the 3D space. In some examples, objectswithin a camera's field of view may move independently of the camera,and move in and out of the camera's field of view. For example, a birdmay fly into view and out again behind a tree. In some exampleembodiments, it may be desired to track objects that are stationary ormove relative to the camera used to capture the scene image. By usingthe systems and methods below, objects may be tracked and the cameraposition may be tracked. In some examples, both of these positiontrackings may be combined when both an object and camera moveindependently of one another.

Additionally or alternatively, the systems and methods here may supporttracking images and objects within a scene. FIG. 24a illustrates anexample overall tracking system. Images 340120 taken with a camera400108 may be used for this method. These can be provided by either alive image stream from a camera 400108, or through a pre-recorded video.An example Odometry Tracker feature 340102 may run on a computing device400106 and calculate for every image the pose of the camera it had whenit captured the image. The pose of the latest image may then be used toplace the virtual camera 340104 in the scene 340110 such that themovement in the scene 340110 corresponds to the movement of the camera340104 in the real world. The scene may be displayed by a technicaldevice 400106. In such examples, the Object Tracker 340106 may detectand track arbitrary objects in the camera images 340120 and place thoseobjects in the scene 340110 accordingly. Several instances 340122 of theObject Tracker 340106 may run in parallel, where each instance issearching for at least one object. Some examples of objects that can betracked are including but not limited to: point clouds 340112, 3Dobjects 340114, 2D images 340116 and artificial markers 340118.

The Odometry Tracker 340102 can internally create a map of depthhypotheses for a subset of pixels in a previous image and use this mapto warp pixel coordinates onto a new image. It can calculate an errorvalue for the current pose between the two images by comparing the pixelcolors in both images. It can optimize the pose between the images in aniterative process to find a warp that corresponds to a minimum errorvalue. The system can use the information of the estimated pose tocalculate a new depth measurement for each pixel that is visible in bothimages. It can update the depth hypotheses in the previous image withinformation from the new measurement through the use of a filter method.In some examples, additionally or alternatively, the Odometry Tracker340102 can select tracked frames as Keyframes and insert them into aconnected graph.

Keyframes may be chosen based on the pose and depth measurementsestimated by the Odometry Tracker 340102. The first frame in a sequenceof images is always the first Keyframe. New frames are tracked relativeto a previous Keyframe. A new frame may be declared Keyframe when thedistance of its pose to the current Keyframe is above a certainthreshold. A new frame may also be declared Keyframe when the its depthmeasurements deviate significantly from the depth hypotheses in thecurrent Keyframe. Selection of keyframes allows the system to avoidcomputations on all frames, and thereby increase computer efficiency,decrease drain on processing resources, and provide faster as well asmore accurate and clear images to display.

The Odometry Tracker can perform pose graph optimization methods on thisgraph to find a globally optimal pose for each Keyframe. The OdometryTracker 340102 can utilize the existing depth maps in this optimizationto correct scale drift within the previously estimated poses of theKeyframes. This may be done by estimating similarity transforms betweenthe Keyframes. In addition to the visual input the Odometry Tracker340102 can also process readings from additional sensors such as but notlimited to, e.g. gyroscope, accelerometer, compass or GPS. The OdometryTracker 340102 can combine the readings of the different sensors intofactors that can integrate them in a factor graph together with theerror functions of the tracked images. It can marginalize these factorgraphs into probability distributions which it might utilize as priorsin the estimation of poses for new frames. Additionally the OdometryTracker 340102 can perform local optimization on the factor graph forposes that have not yet been marginalized.

FIG. 24b illustrates an example of how the method may benefit a user340208. The different scenes 340202, 340204 and 340206 in the exampleare ordered chronologically in time. A user 340208 holds a device with acamera 340210 400102. The user 340208 wants to track an object 340214.There is an area 340212 in which objects can be detected. In someexamples, there may be objects that are not visible, obscured, or areotherwise too far away cannot be detected at any given time. In thefirst scene 340202 the object 340214 is outside the detection area340212. The system does not detect the object in this scene because itis not shown in the camera image. In the second scene 340204, the user340208 moves further away such that the object 340214 can be detected.In the third scene 340206, the user 340208 moves further away. Theobject 340214 is so far away that it cannot be detected by the ObjectTracker 340106, running on a computing device 400106, anymore.

In one example, a hologram of a user is placed in a chair, even if thecamera pans away from the hologram of the user, the system keeps trackof where the hologram of the person stays. Thus, when the camera fieldof view shows the chair again, the hologram is still shown in the sameplace.

Even though the object in the example cannot be detected anymore by thecamera, or is otherwise outside the field of view of the camera, theOdometry Tracker 340102 may continue to function and track the cameraposition as well as the object. Because the Odometry Tracker may updatethe position of the camera 340210, the system may also estimate wherethe object 340214 is relative to the camera 340210, 400102 if it doesnot move by maintaining a relative position and direction indicatorbetween camera and object, and in some embodiments, be able to estimatethe position of a moving by using last known position and motion overtime. This may enable tracking of objects over an arbitrary distanceafter it is detected once, even though they may be not visible in thecamera image the whole time. The system may track an object, even if theobject is obscured by another feature or the camera field of view movesoff of the feature. The system them allows for the camera field of viewto move off the target object, and later come back to it and still beable to track its position.

FIG. 24c illustrates example ways the pose of detected objects may becalculated and transformed from the object space to the virtual space.The process of finding objects may be decoupled from the process ofestimating the camera pose. While the camera pose 340310 may becalculated for every frame 340302 that was taken with a camera 400108,the calculation to determine the camera pose 340310 may take severalframes until the calculations are completed. In that case, some framesmay be ignored by the Object Tracker 340312. The Odometry Tracker340308, 340102 may run on the main thread 340304, and estimate thecamera pose in the virtual space 340310 each time a new frame 340302 isprovided. This can directly be used to update the camera pose 340320 inthe scene 340318 examples. Several background threads 340306 may run inparallel, where each thread has its own instance of the Object Tracker340312. Once an instance of the Object Tracker 340312 detects an object,it may calculate its pose in the object space 340314. The translationand rotation may have to be transformed from the object space 340314into the virtual world space 340316. In addition to translation androtation, the correct scale of the object in the virtual world space340316 has to be calculated. With these values, virtual elements caneither be added or updated 340322 in the scene 340318. All calculationscan be performed on a computing device 400106.

FIG. 25a illustrates an example odometry system 350112 which isperformed on a computing device including, but not limited to thecomponents 400106 displayed in FIG. 30a , starts tracking 350104 as soonas a new image 350102 is available, which is delivered by a deviceincluding but not limited to devices described in the FIG. 29a , FIG.30a , 400108. If the virtual odometry successfully tracked 350106 thecoming image, it may provide the virtual pose of the image, includingbut not limited to position and orientation. The visual odometry maytrigger the visual search thread 350116, which may run on the backgroundasynchronously, and pass the image 350102 on to it 350116.Simultaneously, or closely, the odometry system 350112 may continue withthe tracking 350104. In case of losing the tracking, the odometry system350112 may wait for a new image 350102 to arrive and begin with thetracking again.

In some examples, it may be useful to identify an object based on itsshape, color, or other visual attributes, and then use thatidentification to track the object. Object detection using a visualsearch engine 350116 may be processed in multiple ways, e.g. directly onthe device, locally, including but not limited to devices described inthe FIG. 29a , FIG. 30a , 400108, in this case no internet connectionmay be needed, or in a networked arrangement which may provide moreresources and thereby a better performance. The visual search thread350116 may notify the main thread 350112 after it finished processingthe image. If no objects were found 350114, the odometry tracker 350112340102 may continue the tracking 350104. Otherwise, it may check if theodometry tracker 350112 340102 still has not lost the tracking since thelast visual search has been triggered. If it lost the tracking 350108,the result of the visual search 350116 may be ignored. Otherwise, theposition of the detected objects on the current image may be updated350110. Either way, the main thread 350112 may continue with thetracking 350104.

In various examples, the number of detected objects can vary. It ispossible that hundreds of objects may be detected with the same label onan image. Assuming that those objects are stationary, they still can bepositioned correctly using the combination of the visual odometry andthe visual search.

FIG. 25b shows an illustration of one possible use case of the systemsand methods here. In the example. a camera 350206 including but notlimited to FIG. 29a , FIG. 30a , 400108 is moving along a path 350202,while it tracks the coming image 350204, which is delivered by a cameraincluding but not limited to devices described in the FIG. 29a , FIG. 30a, 400108. In some examples, simultaneously, or at nearly the same time,the image 350204 is being processed by the visual search engine 350116on the background thread to detect objects. Once the visual searchengine 350116 detects objects, it may notify the main thread 350112. Themain thread 350112 may then update the absolute pose of objects foundusing the result given by the background thread.

FIG. 25c shows an illustration of a second possible use case of examplesystems and methods. In the example. a camera 350306 including but notlimited to devices described in the FIG. 29a , FIG. 30a , 400108 ismoving along the path 350302, while it tracks the coming image 350304,which is delivered by a camera including but not limited to devicesdescribed in the FIG. 29a , FIG. 30a , 400108. The visual odometry350112 determines whether or not the coming image 350304 is a keyframe350308. A keyframe 350308 in the visual odometry system 350112 is aframe with a depth map and a position that is used for non-keyframeframes to calculate the relative position to a keyframe 350308 andfurther refine the keyframe's depth map. Keyframes are used as areference for all following frames until a new keyframe is created. Theframes between two Keyframes improve the accuracy of the last keyframeby combining the information collected in each individual frame in theKeyframe over time.

The visual search engine 350116 may then process the coming image tofind the keyframes 350308 previously defined. The visual search engine350116 has a set of images with a known position, which means that theabsolute position is known for keyframes 350308 that are found by thevisual search engine 350116. The visual search engine thread 350116 maynotify the main thread as soon as it finds the keyframe. Using thisresult, the main thread can calculate the camera absolute pose, becausethe virtual odometry alone can only provide the virtual pose of eachkeyframe and also it is often not stable enough through the time.

FIG. 26a shows an illustration 360100 of an example workflow of a 3Dmarker generation by using a geometry mesh, which is performed on acomputing device including but not limited to 400106, FIG. 29c . The 3Dscanner application 360110 may generate, in some embodimentscontinuously, 360122 a point cloud 360124 in the background thread360120 using an estimated depth map 360114 of a keyframe 360112 incombination with the keyframe's pose. The depth value of pixels in akeyframe are estimated 360113 using for example optical flow, this way adepth map 360114 can be built based on this depth estimation. Estimatingdepth values and creating a depth map can be performed by the OdometryTracker 340102.

In some example embodiments, alternatively or additionally, the pointcloud 360124 used to generate 360132 a 3D mesh 360133 of the scene maybe stored in a voxel tree to improve the performance, which may be donein a separate thread. The point cloud 360124 generated using a depth map360113 may still contain noise, which may be filtered before thefiltered point cloud 360125 can be used to generate a mesh 360133. Avoxel is a volumetric pixel in a three-dimensional space. Using a voxeltree may speed up this filtering process 360126 in certain embodiments.The voxel tree is a data structure (octree) designed to work withspatial data, therefore it can provide a better performance to process3D points from the point cloud 360124. It can filter the point cloud360124 360126 by smoothing and removing the noise points. Using thevoxel tree to filter the point cloud 360124 360126 can reduce the numberof the noise points, therefore optimize the point cloud to improveperformance for further computations. Furthermore, it can reduce theerror of the resulting 3D mesh 360133.

The 3D mesh 360133 generation process uses a filtered 360125 orunfiltered point cloud 360124 and keyframe poses as an input source. Asa first iteration of the process, the system can compute a normal vectorfor each point in the input point cloud. The normal vector for eachpoint is calculated based on its neighboring points. As a nextiteration, the 3D mesh 360133 generation system can check and orient anormal vector of each point toward the camera pose of the keyframe thatthe point belongs to.

By taking the point could with normal vectors as an input the system canperform a 3D mesh reconstruction. The system can utilize a depth and anumber of samples parameters to control performance of the system,accuracy, quality of details of the resulting 3D mesh 360133. The systemcan utilize the data structure properties of the resulting 3D mesh360133 to trim the resulting 3D mesh 360133 to achieve better quality 3Dmodel. Furthermore, the system can apply texture 360134 to the generated3D mesh 360133.

In some embodiments, the system may recognize that the 3D mesh built ofan object has erroneous points associated with it. In an example, theobject is a box but there is one pixel that is far from the box and ifconsidered part of the object, would make it not a box. The system mayuse algorithms of known features, in this example knowing a box has sixsides of equal dimensions, and identify erroneous points and trim thempoints away from the 3D mesh. The algorithms may compare points to knownrelative distances, or compared to input shapes. Points included inerror may be trimmed away from the 3D mesh.

In some example embodiments, alternatively or additionally, a textureprojection 360134 may be used as an optional step with the generatedmesh 360133 to give the 3D mesh a particular look. For example, thesystem creates a 3D mesh of a house and uses camera images to projectonto the house pictures of it. In other words, the mesh may have acorresponding keyframe 360112 as its texture. Various options may begiven for the texture projection 360134: Projecting a single texture andprojecting and combining multiple textures on one mesh. A single textureprojection 360134 may project the latest keyframe image onto the 3Dmesh. In some cases, the biggest part of the mesh is left untextured,because it is not covered by the keyframe. In such examples, when themulti-texture projection 360134 is applied, multiple keyframe images maybe projected 360134 onto the 3D mesh and on areas with overlappingkeyframes the different keyframes may be merged to a single texture.Merging of multiple keyframes to a single texture is done by weightingthe keyframe's distance to the mesh surface and the keyframe's anglerelative to the mesh surface. This process may provide a better qualityof the textured mesh 360138 and decrease the untextured region of thetextured mesh. When multiple keyframes are used to create the texture,the best suitable keyframe for each part of the mesh's texture may beused to improve the overall texture quality. The result can even befurther enhanced by not only using the best suitable keyframe for eachpart of the mesh's texture, instead using a combination of keyframes tocreate the texture for this part of the mesh's texture.

In these examples, the user can save 360136 this textured mesh 360138into a 3D model and its corresponding keyframe 360112 on the deviceincluding but not limited to devices described in the FIG. 30a ordiscard 360135 the mesh. The keyframe 360112 may be used to extractkeypoints, which may be needed for the tracking process in the virtualscene when it is used as an augmented reality overlay over the physicalscene.

In certain examples, in the browser 360140, the generated textured mesh360138 may be used either as a usual 3D model in a virtual scene or as a3D marker 360144 in the virtual scene when it is used as an augmentedreality overlay over the physical scene. From a user point of view,annotating 360142 a textured mesh 360138 as a 3D marker 360144 for avirtual scene may be more efficient and user friendly than using a pointcloud 360124 for the annotation process.

FIG. 27a shows an example usage of a mesh generation in real time, whichmay be performed on a computing device including but not limited to400106. During the monocular tracking example 370102, each time a newkeyframe 370103 is found, the depth information of the new keyframe isestimated 370104 by the odometry system and from this depth estimation370104 a depth map 370105 may be reconstructed and stored in the memory400114. The depth map 370105 may be used as the input for the pointcloud generation process 370106 and a corresponding point cloud 370107may be created. This point cloud 370107 may be used in the final step,the geometry mesh generation step 370108, so that a 3D mesh 370109 maybe reconstructed based on the point cloud input 370107.

The generated mesh 370108 may be used for different usage scenarioswhich often require a fast or real time reconstruction of the scenewhile the user is moving through the scene. For a realistic augmentationof virtual objects in the physical scene the 3D mesh 370109 may be usedin any combination of the following ways:

The 3D mesh 370109 can be used as an invisible layer during the trackingfor a physic simulation 370110, for example letting a virtual objectmove on top of the surface of the geometry mesh 370109.

The real time generated 3D mesh 370109 can be used for correct occlusion370112 of virtual objects behind physical objects. From the user's pointof view, this generated 3D mesh has the same shape and position as anexisting physical object. The 3D mesh can be invisible, meaning a userwon't see the mesh, but can be used to show other virtual objectsdifferently. Full or parts of virtual objects that, from the user'spoint of view, are behind the invisible 3D mesh won't get rendered. Fromthe user's perspective, this looks as if a physical objects occludes thevirtual object. For example, placing virtual objects behind a physicalwall, so that when the user moves around the physical scene the virtualobject is only visible if the physical wall would not occlude it.

The 3D mesh 370109 can be used for illumination 370114 since thelighting on the virtual object has to be applied correctly to thephysical scene and on the other hand the conditions of the physicalscene have to be applied to the virtual objects. For example, a virtualobject which is placed under a physical table has to be illuminateddifferently the object placed on top of the table.

The 3D mesh 370109 can be used for shadow casting 370116 of virtualobjects on physical objects and also shadow casting of physical objectson virtual objects. The virtual object has to receive and cast shadowsrealistically which interact not only with the other virtual objects butalso have to interact with the physical scene which requires thereconstructed 3D mesh 370109 as the representation of the physicalscene.

Filtered Examples

FIG. 28a depicts the abstract concept of 360° image fusion using acomputing device 400106. 360° images can be created with a sphericalcamera 390102 and the fusion can be performed on a computing device400106. Multiple 360° images 380102 can be merged to one improved image380104 by using a filter. These images 380102 may be taken from the sameposition, which can be achieved for example with a 360° camera on atripod. In such examples, a minimum of two 360° images can be used andthe more input images are used the better the resulting filtered imagecan be. The filter used to merge the images can be a median orhigh-dynamic-range filter, although additional filters would bepossible. The median filter can be used to remove moving objects likepeople or reduce image noise from bad lighting conditions. Thehigh-dynamic-range filter creates one HDR 360° image from multiple 360°images.

FIG. 28b shows the fusion of multiple 360° images using the medianfilter. 360° images can be created with a spherical camera 390102 andthe fusion can be performed on a computing device 400106. The shown 360°images 380221 contain people 380206 who moved through the scene whilethe images were taken and changing noise 380208 from bad lightingconditions. The dots 380204 indicate that more than the depicted two360° images were taken from the same position and used as input for thefilter. The median filter may compare the individual pixels of eachimage and recognize the areas that repeatedly stay the same, whileremoving the areas that change depending on the image. The resulting360° image 380210 contains only the static background without the noise.While this example uses people, other moving objects could also beremoved.

FIG. 28c shows example fusion of multiple 360° images using thehigh-dynamic-range filter. 360° images created with a spherical camera390102 and the fusion performed on a computing device 400106. The toprow depicts the original 360° images including a 360° image with shadowdetail but fewer highlight information 380302 and a 360° image withhighlight detail but fewer shadow information 380304. The dots 380306indicate that more than the depicted two 360° images were taken from thesame position, with changing exposure, and used as input for the filter.The high-dynamic-range filter uses the exposure data of the different360° images to combine them to a new 360° image containing the fulldynamic range 380308.

FIG. 29a depicts a spherical or panorama camera 390100 for shootingimages with a wide field of view 390106 of 360° or less. FIG. 29a is oneexemplary illustration of a spherical or panorama camera 390100 whichwill be used as a representation for the described camera. A sphericalor panorama camera 390100 has at least two lenses 390104, each coveringa different field of view of the total possible 360°. To gain the finalpanoramic or spherical image, each image 390108 of each lens 390104 maybe stitched automatically within the device 390102 software.

FIG. 29b shows a rig construction 390200 example with more than onecamera 390202 with one or more lenses 390204 attached to a mobilearchitecture 390206. FIG. 29b is one exemplary illustration of a rigarchitecture 390200 for shooting wide range spherical or panoramicimages with a field of view 390208 of 360° or less. Each camera 390202in the example shots simultaneously an image 390210. The images 390210of each camera may be stitched together with the corresponding images390210 of the other cameras 390202 to cover the whole field of view390208.

FIG. 29c illustrates a 3D scanner 390302 for content creation inaccordance with certain embodiments which comprises one or more cameralenses 390306 and/or one or more other sensors 390304. The other sensors390304 can include, but are not limited to any combination of, infraredsensors, lasers, sonic, radar, camera, GPS or other distance measuringsystems. The range 390308 covered by the scanner 390302 may be variablein any direction, depending on where the lenses 390306 and sensors390304 are placed on the device 390302. This includes, but is notlimited to, 3D scanning a complete room that surrounds the 3D scannerand scanning a single discrete object. The outcome of the 3D scanningprocess using a corresponding device 390302 may be a model comprising aset of points in 3D space as well as potential additional informationassociated with those points, such as various geometric entities and/orcolor. Models that originate from a 3D scanning process using acorresponding device 390302 may act as an input resource to the systemsand methods here, by which they are used, processed and/or enhanced.

FIG. 29d illustrates an example graphics tablet for digital drawing390402, used for content creation in accordance with the systems andmethods here, comprising a physical surface 390408 as well as one ormore physical input devices 390404 390406. The physical surface 390408may detect the input generated by the physical input devices 390404390406 and translate it into digital graphical information using aprocessor, random access memory and potential additional components forcomputation or transmit the detected input information to a separatecomputing device for the generation of digital graphical information.The physical input devices for generating input information on thephysical surface 390408 can include, but are not limited to the user'shand 390406 and a stylus 390404, i.e., a pen-like drawing apparatus.

Additionally or alternatively, the systems and methods here may utilizevarious computing devices. FIG. 30a illustrates a computing device400106, which can be any kind of mobile device 400102, including but notlimited to smartphones and tablets, or a stationary device, includingbut not limited to personal computers 400122. A mobile device 400102 islight and small enough so that it can be carried around 400104. Thecomputing device 400106 can comprise several components. Thosecomponents can either be included in a single all-in-one device, orspread over multiple devices that work together. A computing device400106 is a is a technical device that needs an energy source 400120 tofunction. For computation, several hardware components may be used: aCPU 400112 and RAM 400114 to perform calculations and some kind ofstorage 400116 to store software and other files. A GPU 400118 can beused to perform the visual processing. A display 400110 can showinformation and other arbitrary things to the user. Some displays 400110can even be used to interact with the computing device 400106, includingbut not limited to touch input. A camera 400108 can be used to capturethe surrounding.

Additionally or alternatively, the systems and methods here may utilizevarious display devices. FIG. 30b describes a non-limiting examplehead-mounted display device 400200 which comprises two lenses 400202 forthe user's eyes, an area or apparatus 400208 where a smartphone orsimilar computing/display device is placed in and which is then broughtinto a position 400206 so that the display of the device is in front ofthe lenses 400202 and the user has a clear look on the two displayhalves. Additionally, there can be an adequate left-out area 400208 onthe backside 400204 of the head-mounted device 400200 so that the cameraof the smart device may transmit a live-stream view while placed in thehead-mounted device 400200. The integrated components which, amongothers, can be contained in the head-mounted device 400200 are describedin detail in FIG. 30 a.

FIG. 30c describes a head-mounted device 400300 which comprises twolenses 400304 (compare 400202 in FIG. 30b ) and a display 400302 whichis placed relative to the lenses 400304 so that it is visible for theuser when the device 400300 is mounted to the user's head. The contentdisplayed on the screen 400302 of the device is provided through aconnection 400306 to an external source including, but not limited to, adesktop computer as described in FIG. 30a which provides thecomputational power to render the virtual scenes. The additionalcomponents which, among others, can be contained in the head-mounteddevice 400300 are described in detail in FIG. 30 a.

FIG. 30d describes an example head-mounted device 400400 which comprisesof a mounting system 400402 where a smart device 400404 like a mobilephone may be placed into. The example includes a passive display 400408which depicts the content shown on the screen 400404 by reflecting itinto the user's eyes 400406. In some examples, this passive display canbe semitransparent to show the virtual content in combination with thephysical world as an Augmented Reality overlay. The additionalcomponents which, among others, can be contained in the head mounteddevice 400400 are described in detail in FIG. 30 a.

FIG. 30e describes an example head mounted device 400500 which comprisesa projector 400502 mounted on the user's head and an onboard processingunit 400504 which allows all needed calculations and renderings to bedone on the head mounted device 400500 without the need of an externalcomputing system. The projector in the example, projects the displayedimages either directly into the user's eyes 400506 or through a surface400508 which can be semitransparent to show the virtual content incombination with the physical world as an Augmented Reality overlay. Theadditional components which, among others, can be contained in the headmounted device 400500 are described in detail in FIG. 30 a.

FIG. 30f describes an example head mounted device 400600 which comprisesa projector 400504 and an onboard processing unit 400502 which allowsall needed calculations and renderings to be done on the head mounteddevice 400500 without the need of an external computing system. Theprojector 400504 in the example may be used to project the digitalcontent on the physical scene 400506 so the user and other users can seethe virtual content without the need to project the content directlyinto the user's eyes. The additional components which, among others, canbe contained in the head mounted device are described in detail in FIG.30 a.

Additionally or alternatively, FIG. 31a describes an augmentation system410100 which comprises a smart device 410101 including but not limitedto a smartphone or a tablet, a sensor component 410102 and a projectorsystem 410106 which is attached to the smart device 410101 and receivesthe images generated on this smart device 410101 and projects them on atarget projection surface 410104. Such surface 410104 is overlaid withthe digital content from the smart device 410101. The sensors 410102 caninclude but are not limited to: accelerometer, gyroscope, compass, GPSand/or camera. The sensors 410102 may be used by the smart device 410101to project the virtual content correctly aligned on the surface area410104. The users field of view 410103 may be overlapping with the partsof the projector's field of view 410105 so that he or she can see theaugmented content on top of the real environment 410104. Such augmentedcontent can be among others captured spherical 360 image or videocontent which was recorded in the past and is now projected onto thesame physical location where it was captured to show to the user thechanges that happened to the physical location since the originalrecording happened. The correct parts of the spherical image are alignedusing the build in sensors 410102 or a manual alignment by the user isalso possible.

FIG. 31b describes an augmentation system 410200 using a head-mounteddevice 410206 which comprises a projector 410201, sensors and acomputing unit 410202. The sensors 410202 can include but are notlimited to: accelerometer, gyroscope, compass, GPS and/or camera. Thehead-mounted device 410206 projects a virtual image on a physical scene410204. This physical surface 410204 is overlaid with the alignedvirtual content from the projector 410201 based on the projector's fieldof view 410205 and the projectors pose in the physical scene calculatedby the sensors 410202 of the head-mounted device 410206. Using ahead-mounted device 410206 shown in FIG. 31b instead of an augmentationsystem 410100 as shown in FIG. 31a has benefits including but notlimited to: the position of the head-mounted device 410206 relative tothe user's head does not change because the user wears the head-mounteddevice 410206, the head-mounted system 410206 can be aligned such thatthe projector always projects his images in the direction where the userlooks at the environment 410204, the user has both hands free whileusing the head-mounted device 410206 and nevertheless can use thehead-mounted device 410206 wherever the users wants it to use. Theuser's field of view 410203 is overlapping with the projector's field ofview 410205. Knowing the projector's 410201 position relative to theuser's eyes enables the head-mounted device 410206 to project theaugmented content in correct perspective. The augmented content isprojected on top of the real environment 410204. The augmented contentincludes but is not limited to 360 images or videos recorded in thepast. For this exemplary use case the augmented content would beprojected onto the same physical location in the environment 410204where it was captured originally to show the user the changes thathappened to the physical location since the augmented content wasrecorded. The augmented content can be aligned to the real environment410204 automatically using built in sensors 410202 or manually by theuser.

FIG. 31c describes the process details how the user 410301 uses thecomputing device 410302 with the attached sensors 410102 410202 andprojector 410303 to project the virtual content 410307 onto the physicalsurface 410306.

When capturing the content, including but not limited to 360 images andvideos, that should later be used as virtual content to augment thephysical environment 410306, the content can be enhanced with additionalmeta information. This additional meta information can include but isnot limited to position and orientation. When sensors 410102 410202 areavailable, those can be used to gain this information. Otherwise, or inaddition to using the sensors, meta information can be set manually. Themeta information can either be saved together with the content or storedseparately. This meta information can be used for the automaticalignment when the content should be projected on top of the physicalenvironment 410306.

The computing device 410302 may calculate the parts of the virtual scene410307 that is projected by the projector 410303 of the overall virtualscene 410304 which is projected by the projector 410303 on the physicalsurface 410306. The physical position and orientation of the device410302 can be applied to a virtual camera in the virtual scene 410304 sothat this virtual camera moves through the virtual scene 410304 in thesame way the physical projector 410303 moves through the physical scene410306. The field of view of the virtual camera uses the identicalvalues as the field of view 410305 of the physical projector so that theprojected image 410307 matches on top of the physical surface 410306.

The device's orientation and position can be calculated automaticallyusing the sensors including but not limited to the visual trackingsystem of the computing device 410301. The visual tracking system asdescribed in 350100 can use the features of the physical scene 410306 incombination with the other device sensors to calculate an absoluteglobal position and align the virtual overlay 410307 with the physicalscene 410306 by using the metadata of the virtual scene 410307 includingbut not limited to GPS and orientation data.

Manual alignment of the section of the virtual scene 410307 can be doneby the user 410301 using the input of the computing device 410302 toallow the user 410301 adjusting the section shown 410307 of the overallvirtual scene 410306. The manual alignment can replace the automaticalignment and therefore also support computing devices 410302 withoutsensors 410102 410202. Manual alignment is also used by the user 410301if the metadata of the virtual content 410304 is not available orincorrect. Alternatively, the manual alignment can be used in additionto the automatic alignment. The benefits of manual alignment afterautomatic alignment include but are not limited to correctingdistortions by the physical scene and correcting tracking errors of thesensors 410102 410202 of the computing device 410302.

The virtual content shown 410304 including but not limited to 360 imagesand videos, has stored the needed meta information including theorientation to north 410309 of the camera when the data was captured.This meta information is then used to adjust the orientation of thevirtual scene 410304 so that the orientation to north 410309 of thevirtual scene 410304 aligns with the orientation to north 410308 of thephysical scene 410309 where the user 410301 is standing in.

If this virtual content 410304 lacks the needed meta information likethe orientation to north 410309, it is rotated manually by the user410301 so that it aligns with the physical scene 410306.

CONCLUSION

As disclosed herein, features consistent with the present embodimentsmay be implemented via computer-hardware, software and/or firmware. Forexample, the systems and methods disclosed herein may be embodied invarious forms including, for example, a data processor, such as acomputer that also includes a database, digital electronic circuitry,firmware, software, computer networks, servers, or in combinations ofthem. Further, while some of the disclosed implementations describespecific hardware components, systems and methods consistent with theinnovations herein may be implemented with any combination of hardware,software and/or firmware. Moreover, the above-noted features and otheraspects and principles of the innovations herein may be implemented invarious environments. Such environments and related applications may bespecially constructed for performing the various routines, processesand/or operations according to the embodiments or they may include ageneral-purpose computer or computing platform selectively activated orreconfigured by code to provide the necessary functionality. Theprocesses disclosed herein are not inherently related to any particularcomputer, network, architecture, environment, or other apparatus, andmay be implemented by a suitable combination of hardware, software,and/or firmware. For example, various general-purpose machines may beused with programs written in accordance with teachings of theembodiments, or it may be more convenient to construct a specializedapparatus or system to perform the required methods and techniques.

Aspects of the method and system described herein, such as the logic,may be implemented as functionality programmed into any of a variety ofcircuitry, including programmable logic devices (“PLDs”), such as fieldprogrammable gate arrays (“FPGAs”), programmable array logic (“PAL”)devices, electrically programmable logic and memory devices and standardcell-based devices, as well as application specific integrated circuits.Some other possibilities for implementing aspects include: memorydevices, microcontrollers with memory (such as EEPROM), embeddedmicroprocessors, firmware, software, etc. Furthermore, aspects may beembodied in microprocessors having software-based circuit emulation,discrete logic (sequential and combinatorial), custom devices, fuzzy(neural) logic, quantum devices, and hybrids of any of the above devicetypes. The underlying device technologies may be provided in a varietyof component types, e.g., metal-oxide semiconductor field-effecttransistor (“MOSFET”) technologies like complementary metal-oxidesemiconductor (“CMOS”), bipolar technologies like emitter-coupled logic(“ECL”), polymer technologies (e.g., silicon-conjugated polymer andmetal-conjugated polymer-metal structures), mixed analog and digital,and so on.

It should also be noted that the various logic and/or functionsdisclosed herein may be enabled using any number of combinations ofhardware, firmware, and/or as data and/or instructions embodied invarious machine-readable or computer-readable media, in terms of theirbehavioral, register transfer, logic component, and/or othercharacteristics. Computer-readable media in which such formatted dataand/or instructions may be embodied include, but are not limited to,non-volatile storage media in various forms (e.g., optical, magnetic orsemiconductor storage media) and carrier waves that may be used totransfer such formatted data and/or instructions through wireless,optical, or wired signaling media or any combination thereof. Examplesof transfers of such formatted data and/or instructions by carrier wavesinclude, but are not limited to, transfers (uploads, downloads, e-mail,etc.) over the Internet and/or other computer networks via one or moredata transfer protocols (e.g., HTTP, FTP, SMTP, and so on).

Unless the context clearly requires otherwise, throughout thedescription and the claims, the words “comprise,” “comprising,” and thelike are to be construed in an inclusive sense as opposed to anexclusive or exhaustive sense; that is to say, in a sense of “including,but not limited to.” Words using the singular or plural number alsoinclude the plural or singular number respectively. Additionally, thewords “herein,” “hereunder,” “above,” “below,” and words of similarimport refer to this application as a whole and not to any particularportions of this application. When the word “or” is used in reference toa list of two or more items, that word covers all of the followinginterpretations of the word: any of the items in the list, all of theitems in the list and any combination of the items in the list.

Although certain presently preferred implementations of the descriptionshave been specifically described herein, it will be apparent to thoseskilled in the art to which the descriptions pertains that variationsand modifications of the various implementations shown and describedherein may be made without departing from the spirit and scope of theembodiments. Accordingly, it is intended that the embodiments be limitedonly to the extent required by the applicable rules of law.

The present embodiments can be embodied in the form of methods andapparatus for practicing those methods. The present embodiments can alsobe embodied in the form of program code embodied in tangible media, suchas floppy diskettes, CD-ROMs, hard drives, or any other machine-readablestorage medium, wherein, when the program code is loaded into andexecuted by a machine, such as a computer, the machine becomes anapparatus for practicing the embodiments. The present embodiments canalso be in the form of program code, for example, whether stored in astorage medium, loaded into and/or executed by a machine, or transmittedover some transmission medium, such as over electrical wiring orcabling, through fiber optics, or via electromagnetic radiation,wherein, when the program code is loaded into and executed by a machine,such as a computer, the machine becomes an apparatus for practicing theembodiments. When implemented on a general-purpose processor, theprogram code segments combine with the processor to provide a uniquedevice that operates analogously to specific logic circuits.

The software is stored in a machine readable medium that may take manyforms, including but not limited to, a tangible storage medium, acarrier wave medium or physical transmission medium. Non-volatilestorage media include, for example, optical or magnetic disks, such asany of the storage devices in any computer(s) or the like. Volatilestorage media include dynamic memory, such as main memory of such acomputer platform. Tangible transmission media include coaxial cables;copper wire and fiber optics, including the wires that comprise a buswithin a computer system. Carrier-wave transmission media can take theform of electric or electromagnetic signals, or acoustic or light wavessuch as those generated during radio frequency (RF) and infrared (IR)data communications. Common forms of computer-readable media thereforeinclude for example: disks (e.g., hard, floppy, flexible) or any othermagnetic medium, a CD-ROM, DVD or DVD-ROM, any other optical medium, anyother physical storage medium, a RAM, a PROM and EPROM, a FLASH-EPROM,any other memory chip, a carrier wave transporting data or instructions,cables or links transporting such a carrier wave, or any other mediumfrom which a computer can read programming code and/or data. Many ofthese forms of computer readable media may be involved in carrying oneor more sequences of one or more instructions to a processor forexecution.

The foregoing description, for purpose of explanation, has beendescribed with reference to specific embodiments. However, theillustrative discussions above are not intended to be exhaustive or tolimit the embodiments to the precise forms disclosed. Many modificationsand variations are possible in view of the above teachings. Theembodiments were chosen and described in order to best explain theprinciples of the embodiments and its practical applications, to therebyenable others skilled in the art to best utilize the various embodimentswith various modifications as are suited to the particular usecontemplated.

Various examples are set out in the following numbered paragraphs:

NP1. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image data; mapping a position in the image data; mappingmarkers in the image data; and inserting an object to the markedposition in the image data.NP2. The method of NP1 wherein the inserting an object is by selectingthe object from a pre-defined list.NP3. The method of NP1 or NP2 wherein the inserting an object is byselecting the object from a newly created list.NP4. The method of any one of NP1-NP3 further comprising, by thecomputer, appointing a trigger with the object to cause an event.NP5. The method of NP4 wherein the trigger is a click or motiondetection.NP6. The method of NP4 or NP5 wherein the event is at least one ofdialing a phone number, opening a web page, transferring to a differentscene, displaying a text box, sending an e-mail and playing a sound.NP7. The method of any one of NP4-NP6 wherein the event is at least oneof causing display of a new object or obscuring an object with aninvisible object.NP8. The method of any one of NP1-NP7 further comprising, by thecomputer, receiving a second 3D image data; mapping a position in thesecond image data; mapping markers in the second image data; storing themapped image data and the second mapped image data in a data storage;indicating the relationship of the image data and second image data aslinked scenes.NP9. The method of any one of NP1-NP8 wherein the objects are animatedobjects.NP10. The method of any one of NP1-NP9 wherein the objects are receivedover the network.NP11. The method of any one of NP1-NP10 wherein the objects are selectedfrom a predefined set of objects.NP12. The method of any one of NP1-NP11 wherein the image data is a 360degree image.NP13. A method of creating a virtual reality scene, comprising, by acomputer with a processor and memory, receiving an image data over anetwork; estimating a depth map of a keyframe of the received image datausing estimated depth values of pixels in the keyframe; and generating apoint cloud using the estimated depth map of the keyframe; generating a3D mesh using the generated point cloud.NP14. The method of NP13, further comprising, by the computer, receivinga second image data over the network, receiving second tracking markersfor the second image data; receiving second objects for the second imagedata; causing display of, the image, the second image data and thereceived second objects using the second tracking markers.NP15. The method of NP13 or NP14 wherein the received image data is twoor three dimensional image data.NP16. The method of any one of NP13-NP15 further comprising, by thecomputer, projecting a texture on the generated 3D mesh for display.NP17. The method of any one of NP13-NP16 further comprising, by thecomputer, inserting an object occlusion into the image data.NP18. The method of any one of NP13-NP17 wherein the image data ismapped to a floor plan.NP19. The method of any one of NP13-NP18 wherein the image data isincluded in a timeline.NP20. The method of any one of NP13-NP19 further comprising, by thecomputer, adding annotations including at least one of text, drawings,and 3D images.NP21. The method of any one of NP13-NP20 further comprising, by thecomputer, causing live streaming display of the image data over thenetwork.NP22. The method of NP14 further comprising, by the computer, causingdisplay of the image on multiple user displays.NP23. The method of NP14 or NP22 further comprising, by the computer,detecting facial features in the image.NP24. The method of any one of NP14, NP22 and NP23 further comprising,by the computer, causing display of a list of images scenes and featuresto allow addition of new image scenes.NP25. The method of any one of NP14 and NP22-NP24 further comprising, bythe computer, causing display of an edit area for a currently selectedscene.NP26. The method of any one of NP14 and NP22-NP25 further comprising, bythe computer, inserting an object into the image data at a markedposition.NP27. The method of NP26 further comprising, by the computer, causinganimation of the inserted object.NP28. The method of NP26 or NP27 further comprising, by the computer,associating a triggerable command with the inserted object, activated bya user in the image scene by an interaction.NP29. The method of NP28 wherein the triggerable command is opening aweb page or transferring to a different scene.NP30. The method of NP28 wherein the triggerable command is causingdisplay of a text box or playing a sound.NP31. The method of NP28 wherein the triggerable command is sending anemail.NP32. The method of any one of NP28-NP31 wherein the interaction is aclick.NP33. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a first 3D image data, wherein the image data includes a firsttimestamp; mapping a position in the image data; mapping markers in theimage data; and inserting an object to the marked position in the imagedata; receiving a second 3D image data, wherein the image data includesa second timestamp; causing display of, the first 3D image and thesecond 3D image according to the first and second timestamps.NP34. The method of NP33 wherein the first 3D image scene is a parentscene and the second 3D image scene is a sub child scene.NP35. The method of NP33 or NP34, wherein the first timstamp is metadataincluded in the first 3D image data.NP36. The method of any one of NP33-NP35, wherein the first 3D imagedata is a 360 degree image and the computer is further configured toapply the first 3D image as texture to a sphere object for display.NP37. The method of NP36 wherein the orientation of the first 3D imagemay be adjusted for display.NP38. The method of NP36 or NP37 further comprising, by the computer,slicing the 3D image data into parts to load independently for display.NP39. The method of any one of NP33-NP38 wherein the first 3D image is avideo.NP40. The method of any one of NP33-NP39 further comprising, by thecomputer, causing display of an edit layout for a user to view, preview,edit and arrange image scenes.NP41. The method of NP40 wherein the edit layout includes image captionand timestamp of an image scene.NP42. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image data; mapping a position in the image data; mappingmarkers in the image data; and inserting an object to the markedposition in the image data; receiving a floor plan for the image data;receiving placement of the received 3D image onto a position in thefloor plan.NP43. The method of NP42 further comprising, by the computer, receivingan image filter for the 3D image data; applying the image filter to thereceived floor plan.NP44. The method of NP42 or NP43 wherein the floor plan is at least oneof, a computer aided design file, a pdf file, or an online map.NP45. The method of NP44 wherein the placement of the received 3D imageonto a position in the floor plan is through a plugin.NP46. The method of any one of NP42-NP45 further comprising, by thecomputer, receiving an indication of a hotspot on the floor plan;causing display of an icon on the hotspot on the floorplan.NP47. The method of any one of NP42-NP46 further comprising, by thecomputer, receiving a second floor plan and stacking the second floorplan.NP48. The method of NP46 or NP47 further comprising, by the computer,associating a triggerable command with the hotspot, activated by a userin the image scene by an interaction.NP49. The method of any one of NP46-NP48 further comprising, by thecomputer, moving the hotspot on the floorplan by a click-and-dragoperation from the user.NP50. The method of NP48 or NP49 wherein the triggerable command isnavigation to another image scene.NP51. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a first 3D image data, wherein the first 3D image data is a360 degree image data and the computer is further configured to applythe first 3D image data as texture to a sphere object for display, andwherein the first 3D image data includes a first directional indicatorcorresponding to a direction the image was originally taken; mapping aposition in the first 3D image data; mapping markers in the first 3Dimage data; receiving a second 3D image data, wherein the second 3Dimage data includes a second directional indicator corresponding to adirection the image was originally taken; using the first and seconddirectional indicators for the first and second data to orient the firstand second data for display.NP52. The method of NP51 wherein, the using the first and seconddirectional indicators for the first and second data to orient the firstand second data for display, includes using a virtual camera and anangle between the virtual camera and the directional indicator.NP53. The method of NP51 or NP52 wherein the first 3D image dataincludes a timestamp.NP54. The method of NP53 wherein the first 3D image data includes awaypoint location identifier.NP55. The method of NP54 wherein the second 3D image data includes awaypoint and timestamp, and using the timestamp and waypoint of thefirst 3D image data to correlate the waypoint and timestamp of thesecond 3D image data.NP56. The method of NP55 further comprising, by the computer, receivingannotation information for the first 3D image data and correlating thereceived annotation information with the first 3D image data fordisplay.NP57. The method of NP55 or NP56 further comprising, by the computer,receiving audio information for the first 3D image data and correlatingthe received audio information with the first 3D image data for displayand playback.NP58. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image data, wherein the 3D image data is a 360 degreeimage data and the computer is further configured to apply the 3D imagedata as texture to a sphere object for display, and mapping a first andsecond position in the 3D image data; calculating distances between thefirst and second positions in the display of the 3D image data.NP59. The method of NP58 further comprising, by the computer, mappingobjects in the display of 3D image data; calculating angles of objectsin the display of 3D image data.NP60. The method of NP58 or NP59 further comprising, by the computer,receiving a floor plan for the image data; receiving placement of thereceived 3D image data onto a position in the floor plan; calculatingdistances between a first and a second position in the floor plan.NP61. The method of any one of NP58-NP60 further comprising, by thecomputer, adding boundaries of a canvas object to the 3D image data, theboundaries including borders of the canvas object.NP62. The method of NP61 further comprising, by the computer, addingboundaries of a second canvas object to the 3D image data, theboundaries of the second canvas object including borders of the secondcanvas object; and calculating distances between a first and secondposition on the canvas object and second canvas object.NP63. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image data, wherein the 3D image data is a 360 degreeimage data and the computer is further configured to apply the 3D imagedata as texture to a sphere object for display, and causing display ofan overlay over the 3D image data, wherein the overlay includes an webpage.NP64. The method of NP63 further comprising, by the computer, resizingthe texture for display on a mobile device.NP65. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image data, wherein the 3D image data is a 360 degreeimage data and the computer is further configured to apply the 3D imagedata as texture to a sphere object for display, and mapping a marker inthe 3D image data at a position; inserting an object to the markedposition in the image data; and adding an annotation to the object.NP66. The method of NP65 further comprising, by the computer, receivinga selection of the added annotation from a list of annotations; andcausing display of the 3D image data with the corresponding addedannotation.NP67. The method of NP66 wherein the display of the 3D image data withthe corresponding added annotations focuses on the annotated object.NP68. The method of NP66 or NP67 wherein the annotated object may beinteracted with by a user.NP69. The method of NP68 wherein the list of annotations includes astatus update after the annotated object was interacted with by a user.NP70. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image data, wherein the 3D image data is a 360 degreeimage data and the computer is further configured to apply the 3D imagedata as texture to a sphere object for display, and providing paintingtools for graphic addition to the 3D image data.NP71. The method of NP70 wherein the painting tools include input fromhardware user interfaces.NP72. The method of NP71 wherein the hardware user interface is at leastone of a computer mouse, a hand held pointer, a joystick, or a touchscreen.NP73. The method of any one of NP70-NP72 wherein the painting toolsinclude free form painting tools.NP74. The method of any one of NP70-NP73 wherein the painting toolsinclude pre-defined geometric shapes.NP75. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image data, wherein the 3D image data is a 360 degreeimage data and the computer is further configured to apply the 3D imagedata as texture to a sphere object for display, and providing input formultiple users in the display of the 3D image data.NP76. The method of NP75 wherein the display includes avatars of themultiple users.NP77. The method of NP76 wherein the avatars in the display are orientedin the 3D image data according to data received by the computer fromtheir corresponding user hardware indicating orientation.NP78. The method of any one of NP75-NP77 wherein the multiple users areremotely located from one another and interact with the server computerover a network.NP79. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image data, wherein the 3D image data is a 360 degreeimage data and the computer is further configured to apply the 3D imagedata as texture to a sphere object for display, and positioning an audiosource in the 3D image data, wherein the positioned audio source in the3D image data is played for users using multiple channels.NP80. The method of NP79 wherein the position of the audio source isable to be moved relative to the 3D image data.NP81. The method of NP79 or NP80, wherein the audio source is triggeredby a user interaction.NP82. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image data, wherein the 3D image data is a 360 degreeimage data and the computer is further configured to apply the 3D imagedata as texture to a sphere object for display, and causing display ofthe 3D image data to a user in a data stream over a network.NP83. The method of NP82 wherein the 3D image data is from a 360 degreecamera.NP84. The method of NP82 or NP83 further comprising by the computer,causing display of a placeholder image if the data stream isinterrupted.NP85. The method of any one of NP82-NP84 further comprising by thecomputer, causing display of a zoomed portion of the 3D image uponinteraction by a user.NP86. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image data; mapping a position in the image data; mappingmarkers in the image data; inserting an object to the marked position inthe image data; and causing a distortion of the 3D image upon selectionby a user.NP87. The method of NP86 wherein the distortion is a fisheye distortionconfigured to allow a user to focus on an object in the 3D image.NP88. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image data; mapping a position in the image data; mappingmarkers in the image data; analyzing and identifying the 3D image datafor facial features.NP89. The method of NP88, further comprising, by the computer, blurringan identified facial feature in a display of the 3D image data.NP90. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a first resolution 3D image data; receiving a secondresolution 3D image data; segmenting the first and second 3D image datainto segments; causing display of an image using both segments from thefirst resolution 3D image data and segments from the second resolution3D image data.NP91. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image data, wherein the 3D image data is a 360 degreeimage data and the computer is further configured to apply the 3D imagedata as texture to a sphere object for display, and applying a rotationto the 3D image data for display.NP92. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a first 3D image data, wherein the first 3D image data is a360 degree image data and the computer is further configured to applythe first 3D image data as texture to a sphere object for display, andreceiving a second 3D image data; comparing the first 3D image data andthe second 3D image data to correlate features common to both;calculating depth of the correlated features of the first and second 3Dimages; and using the calculated depth of the correlated features torender a stereoscopic image display.NP93. The method of NP92 further comprising, by the computer, applying afilter to the first 3D image data and the second 3D image data; mergingthe first and second filtered 3D image data for display.NP94. The method of NP93 wherein the filter removes moving objects.NP95. The method of NP93 wherein the filter removes changing lightconditions.NP96. A method for creating a virtual reality scene, the systemcomprising: by a server computer with a processor and a memory,receiving a 3D image video data, wherein the 3D image data is a 360degree image data and the computer is further configured to apply the 3Dimage video data as texture to a sphere object for display, andanalyzing the 3D image video to identify an object; and tracking theidentified object in the 3D image video.NP97. The method of NP96 further comprising, by the computer,determining a position of a camera used to capture the received 3D imagevideo data; using the determined position of the camera and theidentified object to track the identified object.NP98. The method of NP96 or NP97 wherein the analysis of the 3D imagevideo is by a visual search engine.NP99. The method of NP97 or NP98 further comprising, by the computer,updating an absolute pose of the object.NP100. The method of any one of NP96-NP99, further comprising, by thecomputer, estimating a depth map of a keyframe of the received 3D imagevideo data using estimated depth values of pixels in the keyframe; andgenerating a point cloud using the estimated depth map of the keyframe;generating a 3D mesh using the generated point cloud.NP101. The method of NP100 further comprising, by the computer,extracting keypoints from the keyframe; using the extracted keypoints inthe tracking of the identified object.NP102. The method of NP100 or NP101 further comprising, by the computer,providing tools for annotating the textured mesh.NP103. The method of NP101 or NP102, wherein the textured mesh is usedfor shadow casting in the 3D image video data.

1-31. (canceled)
 32. A method of creating an augmented reality scene,comprising, by a computing device with a processor and memory, receivingimage data over a network, the image data being generated from a cameraincluding multiple frames; estimating a depth map of a keyframe of themultiple frames of the received image data using estimated depth valuesof pixels in the keyframe; generating a point cloud using the estimateddepth map of the keyframe; and generating a 3D mesh using the generatedpoint cloud; wherein the 3D mesh includes multiple keyframe images,wherein the multiple keyframe images are overlapping keyframes merged toa single texture.
 33. The method of claim 32 wherein the keyframe is aframe with a depth map and a position.
 34. The method of claim 33further comprising, by the computer, for non keyframe frames,calculating a relative position to a keyframe using the depth map andposition of the keyframe; and refining the keyframe depth map.
 35. Themethod of claim 34, wherein the 3D mesh is generated by, computing anormal vector for each point in the point cloud, based on neighboringpoints; orienting the computed normal vector of each point toward thecamera pose of the keyframe that the point belongs to.
 36. The method ofclaim 32 wherein the merging of multiple keyframes to a single textureis by weighting a keyframe distance to a mesh surface and a keyframeangle relative to the mesh surface.
 37. The method of claim 36, furthercomprising, by the computer, receiving a second image data over thenetwork, receiving tracking markers for the second image data; receivingobjects for the second image data; causing display of, the image, thesecond image data and the received objects using the tracking markers.38. A system for creating an augmented reality scene, comprising, acomputing device with a processor and memory, configured to, receiveimage data over a network, the image data being generated from a cameraincluding multiple frames; estimate a depth map of a keyframe of themultiple frames of the received image data using estimated depth valuesof pixels in the keyframe; generate a point cloud using the estimateddepth map of the keyframe; and generate a 3D mesh using the generatedpoint cloud; wherein the 3D mesh includes multiple keyframe images,wherein the multiple keyframe images are overlapping keyframes merged toa single texture.
 39. The system of claim 38 wherein the keyframe is aframe with a depth map and a position.
 40. The system of claim 39wherein the computer is further configured to, for non keyframe frames,calculate a relative position to a keyframe using the depth map andposition of the keyframe; and refine the keyframe depth map.
 41. Thesystem of claim 40, wherein the 3D mesh is generated by, the computerfurther configured to, compute a normal vector for each point in thepoint cloud, based on neighboring points; determine an oriented computednormal vector of each point toward the camera pose of the keyframe thatthe point belongs to.
 42. The system of claim 38 wherein the merging ofmultiple keyframes to a single texture is by weighting a keyframedistance to a mesh surface and a keyframe angle relative to the meshsurface.
 43. The method of claim 42, wherein the computer furtherconfigured to, receive a second image data over the network, receivetracking markers for the second image data; receive objects for thesecond image data; cause display of, the image, the second image dataand the received objects using the tracking markers.
 44. Anon-transitory computer-readable medium having computer-executableinstructions thereon for a method of creating an augmented realityscene, the method comprising: by a computing device with a processor andmemory, receiving image data over a network, the image data beinggenerated from a camera including multiple frames; estimating a depthmap of a keyframe of the multiple frames of the received image datausing estimated depth values of pixels in the keyframe; generating apoint cloud using the estimated depth map of the keyframe; andgenerating a 3D mesh using the generated point cloud; wherein the 3Dmesh includes multiple keyframe images, wherein the multiple keyframeimages are overlapping keyframes merged to a single texture.
 45. Thenon-transitory computer-readable medium of claim 44 wherein the keyframeis a frame with a depth map and a position.
 46. The non-transitorycomputer-readable medium of claim 45 further comprising, by thecomputer, for non keyframe frames, calculating a relative position to akeyframe using the depth map and position of the keyframe; and refiningthe keyframe depth map.
 47. The non-transitory computer-readable mediumof claim 46 wherein the 3D mesh is generated by, computing a normalvector for each point in the point cloud, based on neighboring points;orienting the computed normal vector of each point toward the camerapose of the keyframe that the point belongs to.
 48. The non-transitorycomputer-readable medium of claim 44, wherein the merging of multiplekeyframes to a single texture is by weighting a keyframe distance to amesh surface and a keyframe angle relative to the mesh surface.
 49. Thenon-transitory computer-readable medium of claim 48, further comprising,by the computer, receiving a second image data over the network,receiving tracking markers for the second image data; receiving objectsfor the second image data; causing display of, the image, the secondimage data and the received objects using the tracking markers.